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Abstract. We study bond percolation on the Hamming hypercube {0, \} m around the critical probabil- 
ity p c . It is known that if p = p c {l + 0(2 _m/3 )), then with high probability the largest connected com- 
ponent is of size 0(2 2m/3 ) and that this quantity is non-concentrated. Here we show that for any 
sequence e m such that e m = o(l) but e m » 2 _m/3 percolation on the hypercube at p c {l + e m ) has 

|<r??il = (2+o(l))£ m 2 m and |<tf 2 | = o(e m 2 m ) , 



with high probability, where ^2 is the second largest component. This resolves a conjecture of Borgs, 
Chayes, the first author, Slade and Spencer [17] . 
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1. Introduction 

Percolation on the Hamming hypercube {0, l} m is a combinatorial model proposed in 1979byErd6s 
and Spencer |23|. The study of its phase transition poses two inherent difficulties. Firstly its non- 
trivial geometry makes the combinatorial "subgraph count" techniques unavailable. Secondly, the 
critical probability where the phase transition occurs is significantly larger than l/(m- 1), making the 
method of stochastic domination by branching processes very limited. Unfortunately these are the 
two prominent techniques for obtaining scaling windows fsee e.g., l4l [T2l[T8l 1221 [301 1351 14T1 [431 I441I451 ). 

In light of the second difficulty Borgs, Chayes, the first author, Slade and Spencer (T5l [T6l [T7l sug- 
gested that the precise location p c of the phase transition is the unique solution to the equation 

E p J^(0)| = A2 w/3 . (1.1) 

where ^(0) is the connected component containing the origin, 1^(0)1 denotes its size, and A £ (0, 1) 
denotes an arbitrary constant. Later it will become clear how A is chosen. The lace expansion was 
then employed by the authors to show that at p = p c {l + 0(2" m/3 )) the largest connected component 
'ioi is of size Q(2 2m/3 ) whp — the same asymptotics as in the critical Erdos and Renyi random graph 
both with respect to the size of the cluster and the width of the scaling window (see Section fTTTl for 
more details). However, this result does not rule out the possibility that this critical behavior proceeds 
beyond the 0(2~ m/3 ) window and does not give an upper bound on the width of the scaling window. 

The authors conjectured that the giant component "emerges" just above this window (see 1 17, Con- 
jecture 3.2]). They were unable to prove this primarily because their combination of lace expansion 
and sprinkling methodology breaks for p above the scaling window. In this paper we resolve their 
conjecture: 

Theorem 1.1. Consider bond percolation on the Hamming hypercube {0,1} m withp = p c {\ + e), where 
Pc = Pc(h) with A e (0,oo) a fixed constant, and e = e m = o(l) is a positive sequence with e m » 2~ m/3 . 
Then 

l^i I 



where — ► denotes convergence in probability, and 



1, 



E|^(0)| = (4 + o(l))£ 2 m 2 m . 
Furthermore, the second largest component ^ satisfies 

\ C @2\ V 
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The main novelty of our approach is showing that large percolation clusters behave in some sense 
like uniform random sets. We use this to deduce that two large clusters tend to "clump" together and 
form a giant component. This analysis replaces the appeal to the hypercube's isoperimetric inequality 
which is key in all the previous works on this problem (see further details in Section [T31l . It essentially 
rules out the possibility that two large percolation clusters are "worst-case" sets, that is, sets which 
saturate the isoperimetric inequality (e.g., two balls of radius ml 2- \pm around the two poles of the 
hypercube). The precise behavior of the random walk on the hypercube plays a key role in prov- 
ing such statements. Our proof combines this idea with some combinatorial ideas (the "sprinkling" 
method of |3|, see Section fLBI . and ideas originating in statistical physics (Aizenman and Barsky's |1| 
differential inequalities and variants of the triangle condition). Our proof methods are general and 
apply for other families of graphs such as various expanders of high degree and high girth, finite tori 
of dimension growing with the length and products of complete graphs of any dimension (answering 
a question asked in (30]). We state our most general theorem in Section [L5l and illustrate its use with 
some examples. 

The problem of establishing a phase transition for the appearance of a component of size order 2 m 
was solved in the breakthrough work of Ajtai, Komlos and Szemeredi |3 1 . They proved that when the 
retention probability of an edge is scaled as p = elm for a fixed constant c > the model exhibits a 
phase transition: if c < 1, then the largest component has size of order m and if c> 1, then the largest 
component has size linear in 2 m , with high probability. 

At about the same time, Bollobas [12 1 initiated a study of zooming in onto the large scale properties 
of the phase transition on the Erdos and Renyi (22] random graph G{n,p) (see Section fTTTl below) . 
However, unlike G{n, p), the phase transition in the hypercube does notoccm around p = l/(deg- 1), 
where deg denotes the degree of the graph. In fact, it was shown by the first author and Slade 1 3211331 
that p c of the hypercube {0, \} m satisfies 

1 7/2 , 

Pc = - + —r + 0(m- 4 ), (1.2) 

m- 1 ra d 

Here and below we write f{m) = 0(g(m)) if \f(m)\/\g(m)\ is uniformly bounded from above by a 
positive constant, f{m) = 0(g(m)) if f{m) = 0(g(ra)) and g(ra) = 0(/(m)) and /(m) = o(g(m)) if 
f{m)lg{m) tends to with m. We also say that a sequence of events [E m ) m >i occurs with high proba- 
bility (whp) when lim OT ^ 00 P(£' m ) = 0. 

The first improvement to 1 3 1 was obtained by Bollobas, Kohayakawa and Luczak (14) . They showed 
that if p= (l + e m )/(m- 1) withe m = o(l) bute m > 60m -1 (log m) 3 , then |^i| = (2 + o{\))e m 2 m whp. In 
view of 11.21 . it is clear that one cannot improve the regime of e m in their result to more than e m > m~ 2 . 

1/3 

In (Hi, the authors show that when e m > e~ cm and p = p c {l + Em), then |^i| > ce m 2 m whp. Note 

1/3 

that e~ cm » 2~ am for any a > so the requirement on e m of Theorem II. II is much weaker. Our 
result, combined with those in (151 IS [T7], shows that it is sharp and therefore fully identifies the 
phase transition on the hypercube. 

Other models of statistical physics, such as random minimal spanning trees and bootstrap perco- 
lation on the hypercube have been studied before, we refer the reader to 1 6] (7J [471 ■ In the remainder of 
this section we present some of the necessary background and context of the result, briefly describe 
our techniques (we provide a more detailed overview of the proof in the next section) and present a 
general theorem which is used to establish scaling windows for percolation on various other graphs 
studied in the literature. 

1.1. The Erdos and Renyi random graph. Recall that G{n,p) is obtained from the complete graph by 
retaining each edge of the complete graph on n vertices with probability p and erasing it otherwise, 
independently for all edges. Write for the yth largest component obtained this way. An inspiring 
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discovery of Erdos and Renyi |22) is that this model exhibits a phase transition when p is scaled like 
p = cln. When c < 1 we have |^i | = ©(log n) whp and l^i | = Q{n) whp when c> 1. 

The investigation of the case c ~ 1, initiated by Bollobas |12 | and further studied by Luczak (4~D . 
revealed an intricate picture of the phase transition's nature. See |13| for results up to 1984, and 
(4j [35j [36l [42] for references to subsequent work. We briefly describe these here. 

The critical window. When p = (1 + 0(n~ 1/3 )) / n, for any fixed integer j > 1, 

V n 2/3""' n 2/3j 

where Cf/){ =1 are random variables supported on (0,oo), and — ► denotes convergence in distribution. 

The subcritical phase. Let e n = o(l) be a non-negative sequence with e„ » n~ 113 and put p = (1 - 
e n )ln, then, for any fixed integer j > 1, 

l^i I p 
2e^ 2 log(e^) 

Tfoe supercritical phase. Let £„ = o(l) be a non-negative sequence with e„ » n~ 113 and put p = 
(1 + e n )/n, then 



-1, 

ze n n 

and, for any fixed integer j > 2, 



1^/ 1 P , 

1. 



2£„ 2 log(4«) 

Thus, the prominent qualitative features of this phase transition are: 

(1) The emergence of the giant component occurs just above the scaling window That is, only 
in the supercritical phase we have that 1^1 « l^il, and that l^il/rc increases suddenly but 
smoothly above the critical value (in mathematical physics jargon, the phase transition is of 
second order). 

(2) Concentration of the size of the largest connected components outside the scaling window 
and non-concentration inside the window. 

(3) Duality: |^ 2 I in the supercritical phase has the same asymptotics as |^i| in the corresponding 
subcritical phase. 



Theorem 11.11 shows that (1) and (2) occurs on the hypercube (the non- concentration of at p c 
was proved in (29]). Property (3) on the hypercube remains an open problem (see Section[8l. 

1.2. Random subgraphs of transitive graphs. Let us briefly review the study of random subgraphs 
of general finite transitive graphs initiated in (T5l |T6j |T7l . We focus here only on some of the many 
results obtained in these papers. Let G be a finite transitive graph and write V for the number of 
vertices of G. Let p e [0, 1] and write G p for the random graph obtained from G by retaining each edge 
with probability p and erasing it with probability 1 - p, independently for all edges. We also write P p 
for this probability measure. We say an edge is p-open (p-closed) if was retained (erased). We say that 
a path in the graph is p-open if all of its edges are p-open. For two vertices x, y we write x y for the 
event that there exists a p-open path connecting x and y. For an integer j > 1 we write ^ for the jth 
largest component of G p (breaking ties arbitrarily) and for a vertex v we write ( v) for the component 
in G p containing v. 
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For two vertices x, y we denote 

V p (x,y) = £p p (x~ u)P p {u~ v)P p {v~ y). (1.3) 

u,v 

The quantity V p {x, y) , known as the triangle diagram, was introduced by Aizenman and Newman |2] 
to study critical percolation on high-dimensional infinite lattices. In that setting, the important fea- 
ture of an infinite graph G is whether V Pc (0,0) < oo. This condition is often referred to as the triangle 
condition. In high-dimensions, Hara and Slade |27 | proved the triangle condition. It allows to deduce 
that numerous critical exponents attain the same values as they do on an infinite regular tree, see e.g. 
(81 m [33 [39). 

When G is a finite graph, V p (0,0) is obviously finite, however, there is still a finite triangle condition 
which in turn guarantees that random critical subgraphs of G have the same geometry as random 
subgraphs of the complete graph on V vertices, where V denotes the number of vertices in G. That is, 
in the finite setting the role of the infinite regular tree is played by the complete graph. Let us make 
this heuristic formal. 

We always have that V — oo and that A e (0, 1) is a fixed constant. Let p c = p c {X) be defined by 

E Pc (A)l^(0)| = AF 1/3 . (1.4) 

The finite triangle condition is the assumption that ^ Pc {X){x,y) < l{ X =y] + a®, for some a = a (A) suf- 
ficiently small. The strong triangle condition, defined in 1 16, (1.26)] , is the statement that there exists 
a constant C such that for all p < p c , 

V p (x,y) < l [x=y] + Cx{p) 3 /V + a m , (1.5) 

where a m -*■ as m — oo and %{p) - E p 1^(0)1 denotes the expected cluster size. Throughout this 
paper, we will assume that the strong triangle condition holds. In fact, in all examples where the finite 
triangle condition is proved to hold, actually the strong triangle condition 11.51 1 is proved. In 1 16 1 , 11.51 
is shown to hold for various graphs: the complete graph, the hypercube and high- dimensional tori 
Z^. In particular, the next theorem states 111.511 for the hypercube. 

Theorem 1.2 ((H)). Consider percolation on the hypercube {0, l} m . Then for any A, there exists a con- 
stant C = C(A) > such that for any p < p c {\) (as defined in U.4\) ) 

V p (x,y)<l {x=y] + C{x(p) 3 /V+l/m). (1.6) 

As we discuss in Remark 4 below Theorem ll.31 our methodology can be used to yield a simple proof 
of Theorem l 1 .21 without relying on the lace-expansion methods derived in (TB) . The main effort in [15) 
is to show that under condition D1.5I I the phase transition behaves similarly to the one in G{n, p) de- 
scribed in the previous section. The main results obtained in 1 15 1 are the following: 

The critical window. Let G be a finite transitive graph for which J1.5I I holds. Then, for p - p c {\ + 
0(V" 1/3 )), 

PL4" 1 V 2 ' 3 < |^i | < AV 2 ' 3 ) = 1 - 0(A _1 ) . 

The subcritical phase. Let G be a finite transitive graph for which I11.5H holds. Let e = o(l) be a non- 
negative sequence with e » V~ 113 and put p = p c {l-e). Then, for all fixed 8 > 0, 

P(|^i | < (2 + <S)£r -1 log(£ 3 V)) = l- 0(1) . 

The supercritical phase. Let G be a finite transitive graph for which 11.51 holds. Let e = o(l) be a 
non-negative sequence with e » V~ 113 and put p = p c (l + e). Then, 

Pd^il > AeV) = 0{A~ l ) . 

Thus, while these results hold in a very general setting, they are incomplete. Most notably, in the 
supercritical phase there is no matching lower bound on 1^1. So, a priori, it is possible that |^i| is 



6 



REMCO VAN DER HOFSTAD AND ASAF NACHMIAS 



still of order V 213 when p = p c [\ + e) for some e » V~ 113 and that the scaling window is in fact much 
larger than V~ 113 . It remains an open problem to show whether 11.51 by itself implies that |^i|/(eV) 
converges in probability to a constant in the supercritical phase. 

As we mentioned before, the particular case of the hypercube was addressed in [171 . There the 
authors employed some of the result of Gj3[E6] together with a sprinkling argument to provide a lower 

1/3 

bound of order e2 m on 1^ | valid only when e > e~ cm . We will rely on the sprinkling method for the 
arguments in this paper, so let us briefly expand on it. 

1.3. Sprinkling. The sprinkling technique was invented by Ajtai, Komlos and Szemeredi |3| to show 
that l^i | = ®(2 m ) when p = (1 + e) / m for fixed e > and can be described as follows. Fix some small 
9 > and write p\ = (1 + (1 - 6)e)lm and p2 > delm such that (1 - pi)(l - pz) = 1 - p. It is clear that 
G p is distributed as the union of the edges in two independent copies of G pi and G P2 . The sprin- 
kling method consists of two steps. The first step is performed in G pi and uses a branching process 
comparison argument together with Azuma-Hoeffding concentration inequality to obtain that whp 
at least c 2 2 m vertices are contained in connected components of size at least 2 Cim for some small but 
fixed constants C\,c 2 > 0. In the second step we add the edges of G P2 (these are the "sprinkled" edges) 
and show that they connect many of the clusters of size at least 2 Cim into a giant cluster of size 0(2 m ). 

Let us give some details on how the last step is done. A key tool here is the isoperimetric inequality 
for the hypercube stating that two disjoint subsets of the hypercube of size at least C22 m 13 have at 
least2"Vm 100 disjointpaths of length C(C2)\/m connecting them, for some constant C(c 2 ). (The m 100 
in the denominator is not sharp, but this is immaterial as long as it is a polynomial in m.) This fact is 
used in the following way. Write V' for the set of vertices which are contained in a component of size 
at least 2 Cim in G pi so that V > C22 m . We say that sprinkling fails when |1^xl ^ C22 m /3 in the union 
G Pl u G P2 . If sprinkling fails, then we can partition V' = A i+j B such that both A and B have cardinality 
at least C22 m /3 and any path of length at most C (cz) \/m between them has an edge which is p 2 -closed. 
The number of such partitions is at most 2 2 12 1 . The probability that a path of length k has a in- 
closed edge is 1 - p\ . Applying the isoperimetric inequality and using that the paths guaranteed to 
exist by it are disjoint so that the edges in them are independent, the probability that sprinkling fails 
is at most 

111 ' 

which tends to 0. 



2 2 m /2 £ i ffl _ (j _ r^C(c 2 )^\ 2miml °° = e -2 (1+o(1))m q ? ) 

V m J 



1.4. Revised sprinkling. The sprinkling argument above is not optimal due to the use of the isoperi- 
metric inequality. It is wasteful because it assumes that large percolation clusters can be "worst-case" 
sets, that is, sets which saturate the isoperimetric inequality (e.g., two balls of radius ml2-\fm around 
two vertices at Hamming distance m) . However, it is in fact very improbable for percolation clusters to 
be similar to this kind of worst-case sets. Our approach replaces the use the isoperimetric inequality 
by proving statements showing that large percolation clusters are "close" to uniform random sets of 
similar size. It allows us to deduce that two large clusters share many closed edges with the property 
that if we open even one of them, then the two clusters connect. While previously we had paths of 
length sfrh connecting the two clusters, here we will have paths of length precisely 1. The final line of 
our proof, replacing 11.7I I. will be 

2 2eVI(k m E- 2 ) .(i_ifY ne V < e -0e 3 V(l+o(l)) ^ ^ ^ 

v m> 

where k m is some sequence with k m — ► oo very slowly. This tends to since e 3 V — ► oo. Compared 
with the logic leading to 11.71 . this line is rather suggestive. We will obtain that whp 2eV vertices are 
in components of size at least k m e~ 2 , explaining the 2 2eVI< - kmE ] term in 11.81 . The main effort in this 
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paper is to justify the second term showing that for any partition of these vertices into two sets of size 
eV, the number of closed edges between them is at least e 2 mV — the same number of edges one 
would expect two uniform random sets of size eV to have between them. Therefore, given a partition, 

i „ \me 2 V 

the probability that sprinkling fails for it is bounded by 1 - 2|J 

1.5. The general theorem. Our methods use relatively simple geometric properties of the hypercube 
and apply to a larger set of underlying graphs. We present this general setting that the majority of the 
paper assumes here and briefly discuss some other cases for which our main theorem holds asides 
from the hypercube. We remark that the impatient reader may proceed assuming the underlying 
graph is always the hypercube {0, l} m — we have set up the notation to support this since the hyper- 
cube, in some sense, is our most "difficult" example. 

The geometric conditions of our underlying graphs will be stated in terms of random walks. The 
main advantage of this approach is that these conditions are relatively easy to verify. Let G be a finite 
transitive graph on V vertices and degree m. Consider the non-backtracking random walk on it (this 
is just a simple random walk not allowed to traverse back on the edge it just came from, see Section 
I3.4l for a precise definition). For any two vertices x, y, we put p f (x, y) for the probability that the walk 
started at x visits y at time t. For any £ > 0, we write r mix (<f) for the <f - uniform mixing time of the walk, 
that is, 

r p f (x, y) + p f+1 (x, y) ,1 

I x,y 2 > 

Theorem 1.3. LetG be a transitive graph on V vertices with degree m and define p c as in fi.ifl with X = 
1/10. Assume that there exists a sequence a m = o(l) with a m >\lm such that if we put mo = T-^iam), 
then the following conditions hold: 

(1) m^oo, 

(2) [p c (m-l)] m ° = l + 0(a m ), 

(3) For any vertices x, y, 

£ £ V h {x,u)v h {u,v)v t ^v,y) = 0{a m l\ogV). 

ft + t 2 + Z3>3 

Then, 

(a) the finite triangle condition fi.51) holds (and hence the results in EES] described in Section ing 
hold), 

(b) for any sequence e = e m satisfying e m » V~ 113 ande m = oim^ 1 ), 

r^j^ 1, E|^(0)| = (4 + O (l)) £ 2 m V, ^L^O. 

2.£ m V £m* 

Remark 1. In the case of the hypercube {0, l} m we will take a m = m~ l log m and verify the conditions 
of Theorem 11.31 This is done in Section [TJ Although the behavior of random walk on the hyper- 
cube is well understood, we were not able to find an estimate on the uniform mixing time yielding 
r m ix(m _1 logm) = ©(ralogm) in the literature. To show this we use the recent paper of Fitzner and the 
first author |24| in which the non-backtracking walk transition matrix on the hypercube is analyzed. 
We use this result in Lemma 17711 to verify condition (3) and condition (2) follows directly from J1.2D 
(though condition (2) can be verified by elementary means without using U.2> . see Remark 4). 

Remark 2. Note that part (b) of Theorem l 1 .3l only applies when e m = o(m " 1 ) and not for any e m = o(l). 
Thus, for a complete proof of Theorem 11.11 we also require a separate argument dealing with the 
regime e m > cm' 1 — in the case of the hypercube and other graphs mentioned in this paper, this is a 
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much easier regime in which previous techniques based on sprinkling and isoperimetric inequalities 
are effective. 

Remark 3. Random walk conditions for percolation on finite graphs were first given by the sec- 
ond author in (44). The significant difference between the two approaches is that in |44| the con- 
dition requires controlling the random walk behavior of a period of time which is as long as the 
critical cluster diameter, that is, V 113 . The outcome is that the results of |44| only apply when p c = 
(1 + 0{V~ 1/3 ))/(m - 1) and hence do not apply in the case of the hypercube. Here we are only inter- 
ested in the behavior of the random walk up to the mixing time, even though that typical percolation 
paths are much longer. The reason for this is that it turns out that it is enough to randomize the begin- 
ning of a percolation path in order to obtain that the end point is uniformly distributed, see Section 
12.41 Another difference is that the results in |44| only show that l^i | > ceV for some c> and do not 
give the precise asymptotic value of l^i | as we do here. 

Remark 4. Our approach also enables us to give a simple proof for the fact that the finite triangle 
condition (Theorem 1 1.21 1 holds for the hypercube without using the lace expansion as in |16|. Our 
proof of this fact relies on the estimate p c = l/(m - 1) + 0(m~ 3 ) (which is much weaker than 11. H but 
also much easier to prove) and on the argument presented in Section l2~4l We defer this derivation to 
a future publication. In fact, in this paper we only rely on this easy estimate for p c so our main results 
here, Theorem ll.il are in fact self-contained and do not rely at any time on results obtained via the 
lace expansion in (T6) (we do use arguments of H5) which rely on the triangle condition). We hope 
that this may be useful in future attempts to "unlace" the lace expansion. 

In many cases, verifying the conditions of Theorem 11.31 is done using known methods from the 
theory of percolation and random walks (note that condition (2) involves both a random walk and 
a percolation estimate). We illustrate how to perform this in Section [7] in the case of the hypercube 
(thus proving Theorem ll.lt and for expander families of high degree and high girth (see (34) for an 
introduction to expanders). This is a class of graphs that contains various examples such as Payley 
graphs (see e.g. (20)), products of complete graphs K„ and many others. Percolation on products 
of complete graphs were studied in |30, [3TJ [44] in the cases d = 2,3; our expander theorem allows 
us to provide a complete description of the phase transition in any fixed dimension d, answering a 
question posed in (30] . Recall that a sequence of graphs G n is called an expander family if there exists 
a constant c> such that the second largest eigenvalue of the transition matrix of the simple random 
walk is at most 1 - c (the largest eigenvalue is 1). Also, the girth of a graph is the length of the shortest 
cycle. It is a classical fact that on expanders r m ix(V _1 ) = 0{\ogV), where V is the number of vertices 
of the graph, see e.g. |5, below (19)]. 

Theorem 1.4. LetG m be a transitive family of expanders with degree m — ► oo and V vertices. Assume 
that m > clog V and that the girth ofG is at least clog^^ V for some fixed c > 0. Then the conditions 
of Theorem \ 1 .31 hold and hence the conclusions of that theorem hold. 

For products of complete graphs K%, the girth equals 3, V = n d and m = d{n - 1), so that the girth 
assumption is satisfied for c < 3(1 - o(l))/d and n sufficiently large. Theorem 11.31 applies to other 
examples of graphs, not included in the last theorem, for example, products of complete graphs K„ 
where d may depend on n (as long as n + d -*■ oo) and finite tori but only when d = d{n) grows at 
some rate with n. We omit the details since they are rather similar. We emphasize, however, that there 
are important examples which are methods are insufficient to solve. Most prominently are bounded 
degree expanders with low girth and finite tori where d is large but fixed. It seems that new ideas 
are required to study percolation on these graphs, see Section[8] 
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1.6. Organization. This paper is organized as follows. In Section[2]we give an overview of our proof, 
stating the main results upon which the proof is based. In Section |3] we prove several estimates on 
the number of vertices satisfying various properties, such as having large clusters, or surviving up to 
great depth. We further prove detailed estimates on connection probabilities. In Section |4] we prove 
expected volume estimates, both in the critical as well as in the supercritical regime. In Section[5]we 
prove an intrinsic-metric regularity theorem, showing that most vertices that survive long and have 
a large cluster size have neighborhoods that are sufficiently regular. In Section[6]we show that most 
large clusters have many closed edges between them, which is the main result in our proof. In Section 
Owe perform the improved sprinkling argument as indicated in Section [T~4"l and complete the proof 
ofTheorem ll.il In Section[8]we discuss several open problems. We close the paper in In Appendix|A| 
we sharpen the arguments in |8 | and 1 16 1 to obtain the asymptotics of the supercritical cluster tail. 

1.7. Acknowledgements. The work of RvdH was supported in part by the Netherlands Organization 
for Scientific Research (NWO). The work of AN was partially supported by NSF grant #6923910. RvdH 
warmly thanks the Theory Group at Microsoft Research Redmond for the hospitality during his visit in 
January 2010, where this work was initiated, the Department of Mathematics at M.I.T for their hospi- 
tality during his visit in April 2011, and the Department of Mathematics at U.B.C. for their hospitality 
during his visit in November 2011. We thank Gady Kozma for useful discussions and Gordon Slade for 
valuable comments on a preliminary version of this paper. AN thanks Christian Borgs for introducing 
him to (T71 Conjecture 3.2] when he was an intern at Microsoft Research in the summer of 2005. 

2. Overview of the proof 

In this section we give an overview of the key steps in our proof. Throughout the rest of the paper 
we assume that e = e m is a sequence such that e = o(l) but e 3 V —>■ oo. 

2.1. Notations and tools. Let G be a transitive graph and recall that G p is obtained from G by inde- 
pendently retaining each edge with probability p. Recall that x «-»• y denotes that there exists a p-open 
path connecting x and y. We write d Gp (x, y) for the length of a shortest p-open path between x, y and 

put dc p {x,y) = oo if x is not connected to y in G p . We write x «-»• y if d G ix,y) < r, we write x *-> y 
if dc„ (x, y) = r and x y if d Gp (x, y) e [a, b] . Further, we write x ^-^l y when there exists an open 
path of length in [a, b] between x and y (not necessarily a shortest path). The event {x y} is not 

P[a b] 

increasing with respect to adding edges, but the event {x < — y) is, which often makes it easier to deal 
with. Whenever the sign appears it will be clear what p is and we will drop it from the notation. 
The intrinsic metricbdW of radius r around x and its boundary are defined by 

B^(r) = {y: d Gp {x,y)<r}, <3B^(r) = {y: d Gp {x,y) = r}. 

Note that these are random sets of the graph and not the balls in shortest path metric of the graph 
G. We often drop the G from the above notation and write B x (r) when it is clear what the underlying 
graph G is. We also denote 

B^([a,b]) = {y:x l ^y}. 

Our graphs always contain a marked vertex that we call the origin and denote by 0. In the case of the 
hypercube this is taken to be the all vector. We often drop from notation and write B{r) for Bo(r) 
whenever possible. 

We now define the intrinsic metric one-arm event. This was introduced in [46] to study the mixing 
time of critical G{n, p) clusters and was very useful in the context of high-dimensional percolation in 
|37|. Define the event 

H G {r) = {dB^(r)^0}, 
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for any integer r > and 

r(r) = supP(H G '(r)), (2.1) 

G'cG 

where the supremum is over all subgraphs G' of G. 

The reason for the somewhat unnatural definition of T above is the fact that the event dB^{r) ^ is 
not monotone with respect to the addition of edges. Indeed, turning an edge from closed to open may 
shorten a shortest path rendering the configuration such that the event dB G (r) ^ no longer occurs. 

The following theorem studies the survival probability and expected ball sizes at p c , and is the finite 
graph analogue of a theorem of Kozma and the second author |37|. The proof is almost identical to 
the one in E3 and is stated explicitly in (291138) . 

Theorem 2.1 (Volume and survival probability (37)). Let G be a finite transitive graph on V vertices 
such that the finite triangle condition 17.51) holds, and consider percolation onG at p = p c (A) with any 
A > 0. Then there exists a constant C = C(A) > such that for any r > 0, 

(1) E\B(r)\ < Cr , and 

(2) T(r) <C/r. 

We often need to consider percolation performed at different values of p. We write P p and E p for the 
probability distribution and the corresponding expectation operator with parameter p when neces- 
sary. Furthermore, we sometimes need to consider percolation configurations at different p's on the 
same probability space. This is a standard procedure called the simultaneous coupling and it works 
as follows. For each edge e of our graph G, we draw an independent uniform random variable U (e) 
in [0, 1]. We say that the edge e receives the value U{e). For any p e [0, 1], the set of p-open edges is 
distributed precisely as {e : U (e) < p}. In this way G pi c G P2 with probability 1 whenever p\ < p 2 . 

2.2. Tails of the supercritical cluster size. We start by describing the tail of the cluster size in the 
supercritical regime. Note that the following theorem requires only the finite triangle condition, and 
not the stronger assumptions of Theorem ll.3l and so the restriction e = o(m ) is not needed. 

Theorem 2.2 (Bounds on the cluster tail). Let G be a finite transitive graph of degree m onV vertices 
such that the finite triangle condition 0.51) holds and put p = p c [l + e) where e = o(l) ande » V~ 113 . 
Then, for any k satisfying k » e~ 2 

P(|^(0)| > A;) < 2e(l + 0(e+(e 3 V)~ 1 + (E 2 k)~ 114 + a m )), (2.2) 

and, for the sequence k = e~ 2 [e 3 V) a for any a e (0, 1 /3), there exists ac = c{a] > such that 

P(|^(0)| > /c ) > 2e{\ + 0(e+{e 3 vr c + a m )) . (2.3) 

Theorem l2.2l is reminiscent of the fact that a branching process with Poisson progeny distribution 
of mean 1 + ehas survival probability of 2e(l + 0(e)) when e = o(l). Upper and lower bounds of order e 
for the cluster tail were proved already in 1 16| using Barsky and Aizenman's differential inequalities 1 8| . 
However, to get the precise constant 2 we need to sharpen these differential inequalities and handle 
some error terms in them that were neglected in the past. This derivation and the proof of Theorem 
I2.2l are presented in Appendix A. Its proof is entirely self-contained. 

Let Z>it denote the number of vertices with cluster size at least k, i.e., 

Zz k =\{v: \<£[v)\>k}\. (2.4) 

We use Theorem l2.2l to show that Z> ko , with k as in the theorem, is concentrated. This advances us 
towards the first term on the left hand side of 111.81 . 
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Lemma 2.3 (Concentration of Z> fco ). In setting of Theorem\2^ ifm —- oo, then 

2cV^ ' 

and 

E|^(0)| < {4 + o(l))e 2 V. 

2.3. Many boundary edges between large clusters. The term (l - ^) me V in 11.81 suggests that af- 
ter partitioning the large clusters into two sets of order eV vertices, as we did before, the number of 
closed edges connecting them is of order me 2 V. This is the content of Theorem l2.4l below which is the 
main effort of this paper. It is rather intuitive if one believes that large clusters are uniform random 
sets. Indeed, let v be a vertex in one of the sets of the partition. It has degree m and hence we expect 
me of these neighbors to belong to the second set of the partition. Summing over all vertices v we 
obtain of the order e 2 mV edges. Making this a precise statement requires some details which we now 
provide. 

We work under the general assumptions of Theorem 11.31 In particular, we are given sequences 
e m ,a m such that both are o(l) and e 3 m V — >■ oo and a m > l/m. Without loss of generality we assume 
that 

a m >(e 3 m VV 112 , (2.5) 

otherwise we replace the original a m by i£ 3 m V)~ 1/2 , and note that in both cases a m = o(l) and satisfies 
a m > l/m. 

Let us start by introducing some notation. For vertices x, y and radii j x , j y , we define the event 

y- Jx, jy) = idB x (jx) ? 0,dB y (jy) ? and B x (j x ) n B y {j y ) = 0} (2.6) 

Intuitively, if srf{x,y, j x ,j y ) occurs for j x and j y sufficiently large, then x and y are both in the gi- 
ant component. The event sd {x, y, j x ,j y ) plays a central role throughout our paper. We continue by 
choosing some parameters. The role of each will become clear later. We put 

M=M m = logloglog(e 3 yAa m 1 A(em ) _1 ) and r = r m = M m e~ 1 . (2.7) 

Note that M m —> oo in our setting. We choose r by 

r = —\og(a m e 3 V). (2.8) 

In Corollary |43] we prove that E\B(j) | = 0(£ _1 (1 + e) j ) as long as ;' < e~ l [log(e 3 lO - 4loglog(e 3 l/)] — 
the same asymptotics as in a Poisson (1 + e) branching process (though in the branching process the 
estimate is valid for all j). This implies that E|B(ro)| = ®[\/a m eV), a fact that we use throughout the 
paper, but we do not use this right now. 
For vertices x, y and radius £ we define 

S({x,y) -\{{u,u') eE{G) : {x u) o {y u'}, 

\B u {2r + r )| ■ \B u ,{2r + r )| < e 40M c- 2 (E|5(r )|) 2 }| . 

The edges counted in Sg{x, y) are the ones that we are going to sprinkle. Informally, a pair of vertices 
(x, y) is good when their clusters are large and S{{x, y) is large, so that their clusters have many bonds 
between them. We make this quantitative in the following definition: 

Definition2.1 ((r, r )-good pairs). We say that x,y are [r, r ) -good if all of the following occur: 

(1) s4{x,y,2r,2T), 

(2) \*g{x)\ > (e 3 V) ll4 e~ 2 and\^(y)\ > (e 3 V) ll4 e- 2 , 

(3) S 2r+ro {x,y) > (logM)- 1 y- 1 m£- 2 (E| J B(r )|) 2 . 
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Write P r>ro for the number of{r, r ) -good pairs. 

Theorem 2.4 (Most large clusters share many boundary edges). Let G be a graph on V vertices and 
degree m satisfying the assumptions in Theorem lLR Assume thate = e m satisfies 

e m »V~ 113 and £ ra = o(m _1 ), (2.9) 

as in part (b) ofTheorem \1.3[ Take M and r = Me' 1 as in 1 12.71 1, and ro as in D2.8I I. Then, 

{2eV) 2 

In light of Theorem l2.21 we expect that the number of pairs of vertices [x,y] with |^(x)| > (e 3 V) ll4 e~ 2 
and |^(y)| > [e 3 V) ll4 e~ 2 is close to {2eV) 2 . Theorem 12.41 shows that the majority of these pairs have 
clusters that share many edges between them. This allows us to proceed with the sprinkling argument 
leading to 11.81 and we perform this in Section lTTTl leading to the proof of Theorem ll.31 Since the latter 
proof assumes only Theorem 12.41 the curious reader may skip now to Section [77T1 to see how this is 
done. 

2.4. Uniform connection bounds and the role of the random walk. We briefly expand here on one 
of our most useful percolation inequalities and its connection with random walks. In the analysis 
of the Erdos-Renyi random graph G{n,p) symmetry plays a special role. One instance of this sym- 
metry is that the function f{x] = P(0 <->■ x) is constant whenever x ^ and its value is precisely 
(V - 1)" 1 (E| < ^(0)| - 1) and 1 when x = 0. Such a statement clearly does not hold on the hypercube 
at p c : the probability that two neighbors are connected is at least m" 1 (recall 1 11.21 and the probabil- 
ity that is connected to one of the vertices in the barycenter of the cube is at most \/m2~ m E\ c to{0)\ 
by symmetry. 

A key observation in our proof is that one can recover this symmetry as long as we require the 
connecting paths to be longer than the mixing time of the random walk. A precise statement is that 
percolation at p c on any graph G satisfying the assumptions of Theorem ll.31 

P(0^x)< (1 + (1))^1, (2.10) 

where mo is uniform mixing time, as defined in Theorem ll.31 This is the content of Lemma l3.13l (or 
rather by taking r — oo in the lemma). This gives us an advantage at estimating difficult sums such as 
V p (0,0) as in (II .3D . see Section lBTBl 

2.5. Sketch of proof of Theorem l2.4i The difficulty in Theorem l2.4l is the requirement (3) in Defini- 
tion [27TJ Indeed, conditioned on survival (that is, on the event si{x, y,2r,2r)), the random variable 
S2r+r {x, y) is not concentrated and hence it is hard to prove that it is large with high probability. In 
fact, even the variable \B{r ) | is not concentrated. This is not a surprising fact: the number of descen- 
dants at generation n of a branching process with mean p > 1 divided by p n converges as n —>■ oo to a 
non-trivial random variable. Intuitively, this non- concentration occurs because the first generations 
of the process have a strong and lasting effect on the future of the population. 

In order to counteract this, we condition on the event si (x, y, r, r) and on the entire balls B x {r) and 
B y {r) including all the open and closed edges touching them (during the actual proof we will use 
some other radii j x , j y between r and 2r but this is a technical matter). We will prove that given this 
conditioning the variable S r+ro (x, y) is concentrated around the value 

\dBAr)\\dB y {r)\V- l mmB{r Q )\) 2 , (2.11) 

and that \dB x {r)\\dB y {r)\ > e" 2 with high probability, yielding that requirement (3) in Definition 12. II 
occurs with high probability conditioned on the event above. Our choice of r in ( I2.8D is made in such 
a way that the above quantity is large (however, later we will see that r cannot be too large). Let us 
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elaborate on the estimate 12.11L Assume that B x {r) = A and B y {r) = B and write ~Pa,b an d Ea,b for the 
conditional probability and expectation given B x (r) = A and B y {r) = B. We have 

E A , B S r+ro (x,y) a Y, Y p AB(ia+^ u}o{b<& u'}), 

aedA,bedB (u,u') 

where we did not write equality because a) we ignored the second condition in the definition of 
S r+ro {x,y) on \B u [2r + r )| • \B u r(2r + r )|, and b) some edges {u, u') maybe over-counted in the sum, 
and c) we have neglected to count the closed edges {u, u') that connect A and B (that is, occurring 
in height smaller than r). However, it turns out that all of these contributions are small compared to 
12.11I I. It is a standard matter by now to manipulate, using the triangle condition, to obtain that for 
most edges {u, u') 

Pa,b({<2^ u}o{b^ u'}) ~P a ,b{ci^ u)V A } s{b *—* u'), 
so in order to proceed we need a good lower bound on Va,b{o> u). Lemma [3TT31 (see also 12.101 
immediately gives that P(a *^* u) > (1 - o(l))y _1 E|B(r )| for most vertices u (since Z M P(a u) = 
E|B(r )|). Had we had the same estimate for Pa,b(^ ~ u), the lower bound on the conditional first 
moment required to prove the estimate 12.111 1 would follow immediately. However, the probability 
Pa,b(« *^* u) may heavily depend on the sets A and B. 

To that aim, in Section [5] we present an intrinsic metric regularity theorem, similar in spirit to the 
extrinsic metric regularity theorem presented in (39] . Roughly, it states that for most sets A (more 
precisely, the weight of sets not having this is o[e)) for which B x {r) = A satisfies dB x {r) ^ 0, we have 
that most vertices aedA satisfy 

2> A (aA M )>(i- (i))E|fl(r )|, 

u 

where is the conditional probability given B x {r) = A. Thus, the expected size of the "future" of 
most vertices on the boundary is not affected by the conditioning on a typical "past". 

At this point comes another crucial application of the uniform connection bounds described in the 
section above. Indeed, even if the expected "future" of a vertex has the same asymptotics with or with- 
out conditioning, we cannot a priori rule out the possibility that this conditional "future" concentrates 
on a small remote portion of the underlying graph G — this can potentially violate the concentration 
around the value in 12.111 . However, our uniform connection bounds stated in Lemma l3.13l are robust 
enough to deal with conditioning and immediately imply that V A {a *^* u) = (1 - o{l))V~ l E\B{r ) \ for 
most a in dA and for most vertices u. In other words, not only did the conditioning not influence the 
size of the "future", it also left its distribution approximately unaltered. These considerations allow 
us to give a lower bound of J2.1U on the conditional expectation. This and the conditional second 
moment calculation required to show concentration are performed in Section[6l 

3. Preliminaries 

In this section we provide some preliminary results that we will use. These involve various expecta- 
tions and probabilities related to the random variable \dB{r) \ in Section l3T2l and l331 non-backtracking 
random walks in Section [3~4l and its relation to uniform bounds for connection probabilities in Sec- 
tion !3.5[ In Section [3T61 we use these results to prove part (a) of Theorem 1 1.31 Finally, in Section 13771 
we bound triangle and square diagrams. The results in this section do not rely on the assumptions of 
Theorem ll.3l but sometimes we do assume the finite triangle condition 11.51 1. 

3.1. The "off" method and BK-Reimer inequality. We will frequently handle the events dB{r) ^ 

and x<^j/. These events are non monotone with respect to adding edges, indeed, adding an edge may 
shorten a shortest path and prevent the events from holding. This non-monotonicity is a technical 
difficulty which unfortunately manifests itself in many of the arguments in this paper. Our main tools 
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to deal with this problem are the BK-Reimer inequality [TO EQ] and the notion of events occurring 
"off" a set of vertices. For the BK-Reimer inequality we use the formulation in (T9) . 

For a subset of vertices A, we say that an event Jl occurs off A, intuitively if it occurs in G p \ A. 
Formally for a percolation configuration co, we write (±>a for the configuration obtained from co by 
turning all the edges touching A to closed. The event "Jl occurs off A" is defined to be {oj : w^e Jl}. 
We also frequently write P ff a to denote the measure V Q a A {Jl) = P p {Jl off A). Equivalently P ff a 
can be thought of as a percolation measure in which all edges touching A are closed with probability 
1 and the rest are distributed independently as before. We often drop p from the notation when it 
is clear what p is. This framework also allows us to address the case when A = A{co) is a random set 
measurable with respect to G p (the most prominent example is A = B (r)). In this case, the event 
{Jl occurs off A{(d)} is defined to be 

{Jl occurs off A((o)} = {a) : (i)a(w) e Jl) ■ 

Let us review an example occurring frequently in our arguments in which Jl is an arbitrary event 
and A - B x {s). In this case, 

P{Jl of£B x {s)) = £P(B x (s) = A)P(Jl off A), (3.1) 

A 

where we have used the fact that 

P(Jl off B x (s) I B x (s) = A) = P{Jl off A), 

since the events do not depend on edges touching A in both sides of the equation, and the marginal 
of the two distributions on the edges not touching A is the same product measure. In terms of this 
notation, for a subset of vertices A, we define 

B^(r;A) = {y: d Gp {x,y) < r off A}, dB%{r;A) = {y: d Gp {x,y) = r off A} 

to be the intrinsic ball off A and its boundary. We finally say that Jl occurs only on A'xiJl occurs but 
Jl off A does not occur. We frequently rely on the following inclusion: 

Claim 3.1. For any event Jl and any subset of vertices A<zV, 

Jl \ {Jl only on A} c {Jl off A} . 

Proof. By definition of "Jl only on A" the event on the left hand side equals 

Jln{Jl c u{Jl off A}}. 

From this, it is easy to see that this event implies Jl off A. □ 

Remark. Equality in Claim lBTTl does not hold (unless the right hand side is replaced by Jl n {Jl off A}) . 
This can easily be seen by taking a non-monotone event, say dB x {r) ^ 0. 

The following lemmas are inequalities valid for any graph G and any p. 
Lemma 3.2 (Disjoint survival). For any graph G, pe [0, 1], vertices x, y, z and integers r, s, 

P(dB x (r) ^ 0,dB y (s) 0,B x (r)nB y (s) = 0) < P(dB x (r) ? 0) max P off A (dB y (s) ^ 0) . 

A<zV 

Proof We condition on B x (r) = A such that A satisfies dB x (r) ^ and P(5 x (r) = A) > 0. The left hand 
side then equals 

X P(B x (r) = A)P(dB y (s) ji off A) , 

A: dB x {r)?0 

as in 1 13.11 1 . The lemma now follows. □ 
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Lemma 3.3 (Tree-graph bound revisited) . For any graph G, p e [0, 1] , vertices u, v and integers r, £ > 0, 
P(0 ~ u and ~ v) < Y Y P(0 ~ z)P(z <^ i/) maxP off A (z ~ a) . 

Proo/ We claim that if <— * u and — * v, then there exist zeV and f < r such that the two following 
events occur disjointly 

(a) There exists a shortest open path r\ of length r between and u such that 77(f) = z, and 

(b) There exist an open path between z and £ of length at most £. 

Indeed, if the event occurs, let r\ be the lexicographical first shortest path of length r between and 
u and let y be an open path of length at most £ between and v. We take z to be the last vertex on y 
which belongs to r\ and put t such that 77(f) = z. The witness for the first event is the set of open edges 
of 77 together with all the closed edges in G p (the closed edges determine that r] is a shortest path) 
and the second witness is the edges of 7 from z to v. These are disjoint witnesses so we may use the 
BK-Reimer inequality and bound 

P(0~ uandO^ v) < £ P((a))P(z ~ v) . 

zeV,t<r 

To bound P((a)) we condition on B {t) = A such that A satisfies *-> z, so 

P((a))= £ P(B (t) = Amz^u off A), 

and the lemma follows. □ 

3.2. Survival probabilities. In this section, we prove Lemma IZ3l and a few other useful estimates of 
a similar nature. In the rest of this section we only rely on the finite triangle condition 111.5I I. Theorem 
I2.1l and Theorem l2.2l (both follow from the triangle condition). 

Lemma 3.4 (Relating connection probabilities for different p's). Let p\,p2 e [0, 1] satisfy p\ < P2 and 
letr>0 be integer. The following bounds hold for any graph G and vertex v: 

(1) V p2 {dB v {r) ji 0) < [^) r p pi (dB v (r] ± 0), 

(2) E p2 \dB v (r)\ <(^)' E pi \dB v {r)\. 

Proof. We recall the standard simultaneous coupling between percolation measure at different p's 
discussed in Section lZTl Order all the paths in G of length r starting at v lexicographically. Write sd 
for the event that dB v {r) ^ in G P2 and that the lexicographical first p 2 -open shortest path of length 
r starting at v is in fact pi-open. We claim that 

P(*/) = (— )'p(dBv(r)^0mG P2 ). (3.2) 

Indeed, conditioned on the edges of G P2 , the value U (e) of each edge in G P2 is distributed uniformly on 
the interval [0, p 2 ]. Hence, the probability of the first shortest path being pi-openinthis conditioning 
is precisely {p\lp2) r , which proves 03 -2D . To see (1), note that if the first p 2 -open shortest path is p\- 
open, then it is a shortest path of length r in G pi , so that sd implies that dB v (r) ^ in G pi whence 

V pi {dB v {r) ? 0) > (^ r p p2 (dB v {r) ? 0). 

The proof of (2) is similar and we omit the details. □ 
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Corollary 3.5 (Correlation length is lie). LetG be a transitive finite graph for which U.5D holdsandput 
p = p c (l + e). The following bounds hold for any subset of vertices A and any vertex v and any integer 
r: 

(1) P off A [dB v (e~ l ) ?0) = 0(e), and 

(2) E\B v (r;A)\ = 0(r(l + e) r ). 

Proof. The result is immediate by combining Lemma l3.4l and Theorem l2.11 □ 

Remark. In Section|4]we will show a sharp estimate replacing (2) in the above corollary. 

Lemma 3.6 (Supercritical survival probability). Let G be a transitive finite graph for which 17.51) holds 
andputp = p c {l + e). Then, for any M — • oo and any subset of vertices A, 

Poff a &B (Me' 1 ) ?0)< (2 + o(l))e, 
and, for any M < loglogte 3 V) such that M — ► oo, 

P(dB(Me~ 1 ) ? 0) > (2- o(l))e. 
Proof. To prove the upper bound we write 

P off A (dB(Me~ l ) ji0) = P of f A (dB(Me- 1 )?0,\^(O)\>s/Me- 2 ) 

+P oiiA (dB(Me- 1 ) ? 0, |^(0)| < ^Me~ 2 ) . 

The first term on the right hand side is at most (2 + o(l))e by Theorem 12.21 — note that we used the 
fact that the event | ^ (0) | > k is monotone so P ff a (I ^ (0) I > k) < P(| ^ (0) | > k) . It remains to show that 
the second termiso(£). Indeed, if this event occurs, then there exists a radius j e [Me -1 l3,2Me~ l /3] 
such that 

0<dB(j;A) <3M~ ll2 e~ l and dB(Me~ 1 ;A) ?0. 
Let / be the first level satisfying this. By Corollary l3.5l and the union bound 

Poff A {3 y edB(J;A) with dB y (Me~ 1 /3;A) ? off B(J;A) | B(J;A)) < Ce\dB(J;A)\ = 0(M~ 112 ) . 

Corollary l3.5l also shows that P ff A (dB(Me~ 1 13) ^ 0) = 0(e), so putting this together gives 

P ff A (dB(Me~ l ) ^0,\ c €(x)\ < \fMe~ 2 ) = 0(M~ ll2 e), (3.3) 

concluding the proof of the upper bound. For the lower bound, take k = e~ 2 (e 3 V) a for some fixed 
a e (0,1/3). We have 

V(dB(Me~ l ) ^ 0) > V(dB(Me~ l ) ^0 and |^(0)| > A; ) 

= P(|^(0)| > fc o )-P(|^(0)| > k and dB(Me~ l ) = 0), 

so by Theorem 12.21 it suffices to bound the last term on the right hand side from above. Indeed, by 
Markov's inequality and Corollary l3.51 

P(|^(0)| > A; anddB(Me~ 1 ) = 0) < P(|B(Me _1 )| > k ) (3.4) 

CMe M e~ 1 



o[e(e 3 Vr a log(e 3 V)] = o(e), 



k Q 

since M< loglog(£ 3 V). □ 

We proceed with preparations towards the proof of Lemma [231 For an integer r > 0, we write N r 
for the random variable 

N r = \{x:dB x (r)^0}\. 

We think of lie as the correlation length, see |26|. In other words, when r » lie, then the vertices 
contributing to N r should be those in the giant component. 
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Lemma 3.7 (N r is concentrated). Let G m be a sequence of transitive finite graph with degree m for 
which (1.50 holds and m^oo. Putp- p c (l + e), then for any r » l/e satisfying r < e~ 1 \og\og[e 3 V), 

N r p 



2cV 

Proof. We use a second moment method on N r . Lemma l3^6l and our assumption on r shows that 

EN r = (2 + o(l))eV. 

The second moment is 



EN 2 = £P(dB x (r) ? and dB y {r) * 0) . 

x,y 



We have 



P(dB x (r)^0anddB y (r)^0) < P(dB x (r) ji and dB y (r) ^ 0,B x {r) n B y (r) = 0) 

+P(x~y). 

We sum the first term on the right hand side using Lemmas 13.21 and 13.61 and the second term using 
Corollary|3J3 We get that 

EN 2 < (4 + o(l))c 2 V 2 + 0(Vr(l + e) 2r ) = (1 + o(l)) [EN r ] 2 , 

since Vr[l + e) 2r = o{e 2 V 2 ) by our assumption on r and since e 3 V — ► oo. The assertion of the lemma 
now follows by Chebychev's inequality. □ 

Proof of Lemma l2.3l Take M = M m and r as in 1 12.71 1 and write 

Z, kQ =N r + \{x: aB x (r) = 0,|^(x)|>A;o}|-|{x: dB x (r) ? 0, \^(x)\ < k }\ . 

By Lemma 13771 N r /(2eV) — ► 1, so it suffices to show that the expectation of both remaining terms is 
o{eV). The expectation of the first term is 

VP(x: dB x {r) = 0, mv)\ > k ) = o(eV), (3.5) 

by 13.41 . The expectation of the second term now must be o{eV) since both N r and Z> fco have mean 

(2 + o(l))£l/ by Theorem l272l and Lemma l3.61 This shows that Z> kQ /{2eV) — ► 1 as stated in Lemma l2~73l 
To prove the upper bound on E|^(0) | we write 



By Corollary|331 



E|«* (0)| = £p(o ~ y) = £p(0 — y) + Ep(o ^ y) . 

y y y 



£P(0 — y) = E|B(2r)| < Ce^logVl/) = o(£ 2 l0, (3.6) 

y 



'-j . i , TP _ [2/',oo) , , 

since rl/» 1. It < ► y, then the event 

{dBo(r) ? 0,dB y (r) ? 0,B o {r) n B y (r) = 0} , 

occurs. Hence Lemmas 13721 and [3T61 give that P(0 y) < (4 + o(l))e 2 and summing this over y gives 
the required upper bound on E|S^(0)|. □ 
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3.3. Disjoint survival probabilities. In this section we show that for most pairs x, y the event sd{x,y, r, r) 
occurs with probability asymptotic to Ae 2 . The point is that r is chosen in such that r » e~ l , where 
£ _1 is the correlation length, but r « e~ l log(£ 3 V) which is the diameter of S^i. 

Lemma 3.8 (Number of pairs surviving disjointly). LetG m be a sequence of transitive finite graph with 
degree m for which fi.51) holds and m -*■ oo. Put p = p c {l + e), then for any r » £ _1 satisfying r < 
£r _1 loglog(£ 3 V) 

| {x, y : sd (x, y, r, r) occurs} | P 
(2eV) 2 " 

Proof. Define 

Nf ] = | {x, y : si (x, y, r, r) occurs} | . 

Then, 

N 2 -\{x,y: x~y}\<N { ?<N 2 r , 

and E|{x,y: x~y}| = o{e 2 V 2 ) as we did in 13.611 . The result now follows by Markov's inequality and 
Lemma l377l □ 

Lemma 3.9 (Most pairs have almost independent disjoint survival probabilities). LetG m be a sequence 
of transitive finite graph with degree m for which 0.51) holds and m — > oo. Putp = p c (l + e). Then,for 
any jxjy - £ _1 loglog£ 3 y such that j x >jy » £~ l > there exist at least (1 - o(l))V 2 pairs of vertices x,y 
such that 

P(^(x,y,; x ,; y )) = (l + o(l))4£ 2 . 

Proof. The upper bound P(,e/ (x, y, j x , j x )) < (1 + o(l))4e 2 follows immediately from Lemmas 13.21 and 
13 .61 and is valid for all pairs x,y. We turn to showing the corresponding lower bound. First note that 
Cauchy- Schwartz's inequality gives E[iV|] > (EA/2) 2 which can be written as 

£P(dB x (r) ? and dB y {r) ? 0) > V 2 Y{dB{r) ? 0) 2 . 

x,y 

We take r = e -1 logloge 3 V. Since P(d£ x (j) ^ 0) is decreasing in j, by Lemma [3T6l and our assumption 
on j x and j y we get 

J^ndBAjx) * and dByijy) ?0)> (4 - o{\))V 2 e 2 . 

x,y 

Secondly, Corollary l3 .51 implies that 

£P(x~y) = VE[\B(2r)\] < CVr(l + e) 2r = o(e 2 V 2 ), 
x,y 

by our choice of r and since e 3 V — ► 00. Since 

^(x,y,j x ,j y ) Q {dB x {j x ) ? 0, dByijy) ^ 0} \ {x y}, 

we deduce that 

£ P(^(x, y, j x , ; y )) > (4 - o(l))e 2 V 2 , 

x,y 

and since the upper bound was valid for all x, y, the lemma follows. □ 

Lemma 3.10. Let G m be a sequence of transitive finite graph with degree m for which fi.5D holds and 
m — ► 00. Put p = p c {\ + e). Then for any r < e~ l log log e 3 V such thatr » e~ l we have 



{x,y: srf{x,y,r,r) and|<#(x)| < (£ 3 I/) 1/4 £" 2 } 
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Proof. By Lemma l377l the assertion follows from the statement that 

|{x: dB x (r) ^ and |<#(x)| < (e 3 V) 1,4 e~ 2 }\/(eV) 0. 

To show this, note that 



V{dB x {r) = and \<g(x)\ > (eWV 2 ) < P[\B x {r)\ > (eWV 2 ) < 3 ^ 1/4g _ 2 = o(e) . 
Hence, by Theorem l2.21 

P(dB x (r) 7^ and |^(x)| > {e 3 V) ll4 e~ 2 ) > (2-o(l))e. 
Together with Lemma lXHl this yields that 

V{dB x {r) * and |<#(x)| < (rWV 2 ) = o{e), 
concluding our proof. □ 

3.4. Using the non-backtracking random walk. In the rest of this section we provide several basic 
percolation estimates which we use throughout the paper. These include bounds on long and short 
connection probabilities and bounds on various triangle and square diagrams. It is here that we make 
crucial use of the geometry of the graph and the behavior of the random walk on it, namely, the as- 
sumptions of Theorem ll.3[ We frequently use non-backtracking random walk estimates. This walk is 
a simple random walk on a graph that is not allowed to traverse back on an edge it has just walked on. 
Let us first define it formally. 

The non-backtracking random walk on an undirected graph G = (V, E), starting from a vertex x e V, 
is a Markov chain {X t } with transition matrix P x on the state space of directed edges 

E = j(x,y): {x,y} ee\. 

If X t = (x, y), then we write X™ = x and Xf = y. Also, for notational convenience, we write 

P ix , w] (-) = *P x {-\X = (x,w)), and p t (x,y) = P x [Xf ] = y). 
The non-backtracking walk starting from a vertex x has initial state given by 

P* ( x oM ^ ) = W3i _i_, 

and transition probabilities given by 

X 1 

P ( M ,,)^1 - (f. ">)) - 1 {iv , w]£ -E, w ^ v]deg[v) _ l ' 

where we write deg(x) for the degree of x in G. The following lemma will be useful. 
Lemma 3.11. Let G be a regular graph of degree m. Then, for t>0, 

£ p 1 (0, u)p l ( u, z) = p f+1 (0, z) + — ^-p'" 1 (0, z), 
u m-1 

with the convention thatp~ l {G,z) = 0. 



Proof. The claim follows by conditioning on whether the first step of the NBW counted in p f (u, z) is 
equal to ( u, 0) or not. We omit further details. □ 
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3.5. Uniform upper bounds on long connection probabilities. In this section we show that long per- 
colation paths are asymptotically equally likely to end at any vertex in G. 

Lemma 3.12. Let Gbea graph satisfying the assumptions in Theorem lL^ and consider percolation on 
it with p < p c ( 1 + e) . Then, for any in teger t> m and any vertex x, 

P p (0 <— x) + P p (0 — x) < E\dB(t-m )\. 

Proof. If *^* x, then there exists a vertex v and simple path co of length mo from x to v such that the 
event 

{a) is open} o { v < * x} , 

occurs. Indeed, consider a shortest path rj of length fbetweenOto x. Take v = r){m ) andoi = r\[\,mo\. 

Now, the first witness is the path oj and the witness for { v < 1 m ° x} is the path r\ [ra , t] together with all 
the closed edges in G p (which determine that 77 [m , t] is a shortest path). These are disjoint witnesses. 

If x occurs, then we get the same conclusion with to of length Wo + 1. We now apply the BK- 
Reimer inequality and the fact that the probability that w is open is precisely . This yields 

Pp(0~x)<p m °£ X P P (" — x), 

v w. \a)\=mo 
(o[mo]=v 

and 

Pp(0^x)<p m ° +1 £ L Ppiv^x). 

v ay: \(o\ = mo + l 
(o[mo+l]=v 

We now bound these above by relaxing the requirement that co is simple and only require that it is 
non-backtracking. Since m = T m ^{a m ), we get by definition that 

\{o>: \a)\ = m ,co[m ] = v}\ Ucd: \<o\ = m + l,a)[m + l] = v}\ mn nin+lt ^ 2 + 2a m 

1 : 1 + 1 1 = p" 10 0, V) + P m ° + L 0, V) < , 

m {m- d^o-i m(m-l) w ° K F V 

where we enumerated only non-backtracking paths in the above. Using this, condition (2) and sum- 
ming over v gives 

Pp(0^x)+P p (0^x) < 2 + ^ am) [p(m-l)] mo E\dB(t-m )\ 

2 + O(a m + em ) 
< E\dB(t-m )\, 

concluding our proof. □ 

Lemma 3.13. Let Gbea graph satisfying the assumptions in Theorem lL^ and consider percolation on 
it with p < p c {l + e). Then, for any r > mo and any vertex x, 

P p (0 < ► xj < E\B{r)\ . 

Proof. The proof is similar to that of Lemma l3.121 If the event occurs, then there exists a vertex v and 
a simple path a) of length mo from to v such that the event 

{a) is open}o {v x}, 



occurs. Hence, 

P[m ,r 



Pp(0^x)<^£ £ Pptv^x), 



v cu: \(i>\ = mo 
(o[rno]=v 
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by the BK inequality. By the same argument, 

v w: |w| = mo + l 
(i)\mo+\] = v 

The reason we make two such similar estimates is that due to possible periodicity, in each of the 
estimates the sum over v may include only half the vertices of V and not all of them. We now take the 
average of these two estimates, sum over v to get the E|B(r)| factor and use the same analysis as in 
Lemma l3.12l using condition (2) . This gives the required assertion of the lemma. □ 

Lemma 3.14. Let Ghea graph satisfying the assumptions in Theorem ] 1.3\ and consider percolation on 
it with p< p c (l + e). Then, for any r>m and any vertex x, 

\r,2r] 1 + 0{a m + em ) 
P(0 U-J X ) < v -2 -E\B([r - m ,2r - m DI . 

Proof. This follows by summing the estimate of Lemma l3.12l and using the fact that 

E\dB(t)\ < p(m-l)E\dB(t-l)\< (1 + ra" 1 + e)E|d£(£- 1)|, 

where the last inequality is due to condition (2) and the first inequality holds since, given dB{t - 1), 
\dB(t)\ is stochastically bounded above by a sum of \dB(t - 1)| binomial random variables with pa- 
rameters m— 1 and p. The lemma follows since a m > m~ l . □ 

We close this section with a remark which will become useful later. We often would like to have 
these uniform connection bounds off some subsets of vertices. The proofs in the lemmas of this 
sections immediately generalize to such a setting. This is because the claim "the number of paths 
from to v of length mo is at most n" still holds even if we are in G\ A, for any subset of vertices A. We 
state the required assertion here and omit their proofs: 

Lemma 3.15 (Uniform connection bounds off sets). Consider percolation on G = {0, \} m with p = 
p c {\ + e), and let A he any subset of vertices. Then, for any r > mo and any vertex x, 

P offj4 (0~x) < {2 + O(a m + em Q ))V- 1 E\dB(r-m ;A)\, 

P ffA(0 — x) < {l + 0{a m + emo))V- l E\B{r;A)\. 

3.6. Proof of part (a) of Theorem 1 1.31 We demonstrate the use of Lemma l3.13l by showing that the 
finite triangle condition holds under the assumptions of Theorem ll.3[ We begin with an easy calcula- 
tion. 

Claim 3.16. On any regular graph G of degree m and any vertices x, y, 

£ £ p tl (x,u)p t2 (u,v)p t3 (v,y) = l {x=y] + 0(m~ l ) , 

u,v r 1) f 2 ,f3:f 1 +f 2 +f 3 e{0,l,2} 

and 

£ £ p ti (x,u)p t2 (u,v)p t Hv,y) = 0(m- 1 ). 

u,v ti,t 2 ,h:ti+t 2 +hE{\,2} 

Proof. We prove both statements simultaneously. The contribution coming from t\ + t 2 + t 3 = is the 
one we get when x = u = v = y giving l\ x =y}- The contributions coming from 1 1 + £2 + h = 1 can only 
come from the cases u = v = y and [t\, £2, £3) = (1,0,0), or u= v = xand [ti, h, £3) = (0,0, 1), or u = xand 
v = y and (£1, £2- W = (0, 1,0). These are easily bounded using the fact that max z p f (w,z) < l/(m-l) for 
any t > 1. We perform a similar case analysis to bound the contributions of £1 + 1 2 + £3 = 2. If (£1, £2, £3) = 
(0,0,2), then we must have u= v = xandp 2 (f,y) = 0(m _1 ), this argument also handles the case where 
one of the other f/s is 2. In the case (£1, £ 2 , £3) = (1, 1,0) we must have that v = y and that u is a neighbor 
both of x and y. There are at most m such u's and for each we have that p : (x, u)\) l {u,y) = 0[m~ 2 ). 
The case (£1, £ 2 , £3) e {(0, 1, 1), (1,0, 1)} are handled similarly. □ 
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Proof of part (a) of Theorem ll.3l Let p < p c . If one of the connections in the sum V p (x, y) is of length 
in [m ,oo), say between x and u, then we may estimate 

^ [m„oo] (1 + 0(1))E P |TO)|„ 

U,V " U,V 

(l + o(l))(Ep|^(0)|) 3 
V 

where we used Lemma l3.13l (and took r —* oo in both sides of the lemma) for the first inequality. Thus, 
we are only left to deal with short connections, 



V p (x,y) < £ P p (x ~ M )Pp( M ~ i/)P p (i/ ~ y) + OiV^xip) 3 ) . 



We write 



too m ° =£1 

Pp(x «— a) = £ Pp(x — i I/) , 

and do the same for all three terms so that 

mo _ _ _ 

Vp(x,y)<£ £ PpCx^i^PpCu^^PpCy^^ + OCy- 1 ^^) 3 ). (3.7) 

U,V tl,t2, t3 

We bound 

P p (x~ u) < m(m-l) fl_1 p fl (x, u)p h , 

simply because m[m - l) fl_1 p fl (x, u) is an upper bound on the number of simple paths of length t\ 
starting at x and ending at u. Hence 

m 3 m ° 

Vp(x, y) < £ £ ^ m " 1)] fl + ' 2+ V 1 (x, u)p f2 (u, f/)p t3 (i/, y) + OiV^xip) 3 ) . 

\ m iJ U,Vti,t 2 ,t 3 

Since p < p c , assumption (2) gives that [p(m - 1)] fl+f2+f3 = 1 + 0{a m ), and condition (3) together with 
Claim|3lE6]yields that 

Vp(x,y) < l {x=y} + 0(V-^(p) 3 ) + Oim' 1 + a m /log V) , 
concluding the proof. □ 

3.7. Extended triangle and square diagrams. In this section, we provide several extensions to the 
triangle condition 11.51 . We will bound the triangle diagram in the supercritical phase (which requires 
bounding the length of connections, otherwise the sums blow up) and estimate a square diagram 
which will be useful in a key second moment calculation in Section^ 

Lemma 3.17 (Short supercritical triangles). Let G be a graph satisfying the assumptions in Theorem 
\1.3\ and consider percolation on it with p < p c (l + e). Then, 

max £ Pp(x «-£ u)V p {u •>— • v)V p {v ~ y) = 0{a m + emo) . 

x ' y u,v. {m,i/}tM0,0} 

Proof. This follows immediately by assumptions (2) and (3) of Theorem l 1.31 and the usual bound 

Pp(x ~ u) < p s m{m - l) s_1 p s (x, u) , 
as we did before. □ 
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Corollary 3.18 (Long supercritical triangles). Let Gbea graph satisfying the assumptions in Theorem 
\1.3\ and consider percolation on it with p < p c {\ + e). Let r\, r 2 , r 3 be integers that are all at least mo. 
Then, 



max 

X ' y U,V. {u,v}?{0,0} 



£ P(x~ u)P(u^ v)P{v £*y) 



(3 + 0{a m + emn)) 

< O(a m + em ) + — E|B(n)|E|5(r 2 )|E|S(r 3 )| . (3.8) 

Proof. We split the sum into two cases. The first case is that at least one of the connection events 

occurs with a path of leng 
bound, uniformly in x, y, 



occurs with a path of length at least mo. For instance, if p ,^ m °' n } u occurs, then we use Lemma l3TT3l to 



Z P[m ,ri] r 2 r 3 

P(X <- — u)P{u <— ► v)P[v <— >• y) 

u,v 



(l + O(a m + em )) r r 2 r 3 . 

E\B{n)\2^V{u-+ v)P(v^y) 



V 

{l + O(a m + em )) 



E|B(r 1 )|E|B(r 2 )|E|B(r 3 )|. 



V 

The second case is when all the connections occur with paths of length at most mo, in which case we 
use Lemma lHTTTl and get a 0{a m + em ) bound. This concludes the proof. □ 

Lemma 3.19 (Supercritical square diagram). Let G be a graph satisfying the assumptions in Theorem 
\1.3\ and consider percolation on it with p< p c {\ + e). Let r\, r 2 be both at least m . Then, 

£ P(zi ~ ui)P(zi ~ u 2 )P(z 2 ~ u[)P{z2 ~ u' 2 ) < Cm 2 (E\B(r 1 )\) 2 (E\B(r 2 )\) 2 + CVm 2 moa m 

{U\,u! l },(U2,u' 2 ),Z\,Z2 

Proof. See Figured] If one of the connections is of length at least m , then we use Lemma l3.13l and the 

summation simplifies. For instance, if U\ \ m °' n } z\, then we use Lemma l3.13l and sum over z\, followed 
by sums over u\ and u 2 . This gives a bound of 

Cm 2 (E\B{r±)\) 2 ^ r 2 , r, , ? ? ? 
/ £ P(z 2 ^u[)P(z 2 ^u' 2 )<Cm 2 [E\B(r 1 )\ 2 E\B(r 2 )\] 2 , 

U V U 2 ,Z2 

where C > 1 is an upper bound on 1 + 0[a m + emo). 
We are left to bound the sum 

£ P(zi » «i)P(zi S u 2 )P(z 2 OS, ui)P(z 2 a u' 2 ) (3.9) 

(M1,m' 1 ),(W2,"2' ,Zi,Z2 

= V 2^ P(0<— Mi)P(0^- u 2 )P{z 2 <— u\)P{z 2 «- * u 2 ), 

(Ml,Mj),(K2,M 2 ),Z2 
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by transitivity. We write this sum as VY. U ' 2 fi.u' 2 )g[u' 2 ), where 

g(u' 2 )= £ P(0^U!)P(z 2 — u[mz2~ u' 2 ), f{u' 2 )= £ P(0^U 2 ). (3.10) 

(Ul,w' 1 ),Z2 U2~u' 2 

We then bound 

^E/("2)g("2) * ^(E/("2))(maxg(^)) (3.11) 

U f 2 ll' 2 U 2 

= VmE|B(m )|max Y P(0 ~ i/i)P(z 2 ~ u',)P(z 2 ~ x). 

X , M 

By condition (2) in Theorem ll.31 we can write the above as 

^E/^2)g r ("2)^Cym 2 E|B(m )|max £ £ p fl (0, Mjp^m, u^K^P^,*) (3.12) 

ll' 2 U\ , Mj ,Z2 f l ' . f3 SO 

<CVm 2 E\B(m )\max £ E p' 1 Ui)[p f2+1 (wi,z 2 ) + — -p t2 ~ 1 (u 1 ,z 2 )]p t3 (z 2 ,x) 

X ui,z 2 fi,f 2 ,f 3 >0 m 1 

< Cym 2 E|S(m )|(a m + 0(l/m)) < CVm 2 E\B{m )\a m , 

where we use Lemma 13.111 in the second inequality, and Claim 13.161 condition (2) in Theorem 11.31 
and a m > 1/ra in the final inequality. Further, E|B(rao)| = O(rao) by Corollary [33] and the fact that 
mo = o(e _1 ). This concludes our proof. □ 

4. Volume estimates 

In this section, we study the expected volume of intrinsic balls and their boundaries at various radii 
in both the critical and supercritical phase. 

4. 1. In the critical regime. Given a subset of vertices A and integer r > we write 

G(r;A) = E\dB(r;A)\, G(r) = max G{r;A). 

AQV(G) 

Theorem 4.1 (Expected boundary size). Let G be a graph satisfying the assumptions ofTheorem \1.3\ 
and consider percolation on it with p = p c . Then there exists a constant C > such that for any integer 
r, 

G(r) < C. 

Proof. Define F(r) = E|5(r)| and F{r;A) = E\B{r;A)\, so that F[r; A) < F{r) for all subsets A. Define 
G* (r) = max ssr G(s), and let r > 2m be a maximizer of G* , that is, r is such that G(r') < G(r) for any 
r' < r. Let A = A{r) be the subset of vertices which maximizes G(r; A) so that G(r; A) = G(r) = G* (r). 
Given such r and A = A(r) we will prove that there exists c> such that for any integer s>0, 

F(r + s;A)>cG*{r)F(s;A). (4.1) 

We begin by bounding 

F{r + s; A) > £P off A (0 — v). 

For a vertex a we define c € u {0; A) = {x : «-»• x off A u {a}}. Now, for any vertex v, if there exists v 

such that <— * 1/ off A and u <->■ v off ^"(0; A), then ^ — ► 1/ off A. Furthermore, if such u exists, then 
it is unique because otherwise v e c € u {0; A). We deduce that 

F(r + s;A) > £ P off A (0 ~ a and {u A v off <# M (0;A)}) . 
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We now condition on c to u {0;A) = H for some admissible H (that is, for which the probability of the 
event ^"(0; A) = H is positive, and in which u occurs). In this conditioning, we also condition on 
the status of all edges touching H. Note that by definition A n H = 0. We can write the right hand side 
of the last inequality as 

E E PoBAi e # u iO;A) = H)P BAiu~vo&H), 

in the same way as we derived 13.11 . This can be rewritten as 

£ E P ffA( c g u (0;A) = H)[P offA (u~v)-P oiiA [u~von\yonH)}. 

v ^ u H: 0<-»u off A 

We open the parenthesis and have that the first part of this sum equals precisely G* (r)F(s; A) since r 
and A were maximizers. We need to show that the second part of the sum is of lower order. To that 
aim, if u v only on H, then there exists h e H such that h ^ u and {u <->■ h] o {h <->• i/}. By the BK 
inequality, we bound the second part of the sum above by 



EE E P o ffA(^ u (0;A) = H)P offA (u^h)P onA {h^v). 

11 H:0~uOffA heH ' h * u ' v 

Summing over v and changing the order of the summation gives that the last sum is at most 

F{s;A) E P o ffA(0~u,0~fc)P O ff,i(M~fc). 

We bound this from above using Lemma l331 by 

F(s;A) E E p off^(0^^) max P(z~ uoff AuD)P oSA {z~ h)P oSA {u± h). (4.2) 

We sum this separately for t < r - ra and f e [r - m , r] . For t< r- ra , we bound for any DcV 

«,=»•-* ™ 3G(r-f-m ;AuD) 3G*(r) 
P(z — u off A u D) < < 



V V 

where the first inequality is by Lemma l3.15l and the second by definition of G* (r). Hence, the sum 
over t < r - m in 114.21 1 is at most 



3G*{r)F(s;A) ? s 3G*(r)F(s; A)(E|^(0)|) 3 _ l3 _ v 

2^ P o ffA(0 — z)P offj 4(z — /z)P offA (/z~ m)< = 3A J G (r)F(s;A), 



1/ V 1/ 

where the inequality we got by summing over u, h and z (in that order) and the equality is due to the 
definition of p c in 11.11 . Our A = 1/10 is chosen small enough so that 3A 3 < 1/2. 
We now bound the sum in 14.21 1 for t e [r - mo, r] . We first bound 

=t 3G*(r) 
Poff^(0 — z)<— — , 

as we did before using Lemma l3TT5l and pull that term out of the sum. This gives an upper bound of 

3G* {r)F(s m A) m ° —s s 

V Y max P{z^- u off A u D)P ofi A {z^h)P off A {u^*h). 



We would like to sum the first term in the sum over si and get a contribution of P(z *^ u). We cannot 
do that however, because the maximizing set D may depend on s\ so these are not necessarily disjoint 
events. Instead we bound for all D Q V(G) 

P(z~ uoff AuD) < m{m-\) Sl p s c l v Sl {z, u) < (1 + o(l))p Sl (z, u) 

where the first inequality is since m{m- l) Sl p Sl (z, u) bounds the number of simple paths of length Si 
connecting z to u and the second inequality is due to condition (2) of Theorem 1 1.31 Now, if one of 
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the connections z *-> h or u h is in fact a connection of length at least mo we use Lemma 15 .131 to 

simplify the sum. For instance, if the connection is z \ m °' 00 } h, then we bound the probability of this 
by 2 V~ l E\^ (0)| and the sum simplifies to 

— 2 2^ L P (z,u)P(u~ h), 

V u^h,zsi=0 



we then sum over h, u, z and S\ and get a contribution of 

4G*(r)F(s;A)m (E|^(0)|) 2 



o(G*(r)F(s;A)), 



V 

since (E|^(0)|) 2 = 0{V 213 ) and ra = o(e _1 ) = o{V 113 ). Thus, it remains to bound 



3G*(r)F(s;A) ™° ™ ™ 
L L P Hz.MjPoff^U"— ^)P ff h). 

u^h,zs\=0 



We bound this by 



CG* ir)F{s' A] m ° 

v L E p Sl U,u)p S2 (w^)p S3 (^2) = o(l)-G*(r)F(5;A), 

where we used Claim l3TT6l and condition (3) of Theorem ll.3[ This concludes the proof of 14.11 . 

We now turn to prove the main result assuming 14.11 1 . First, for any r < 2 mo we have that the number 
of non-backtracking paths emanating from is at most m(m - l) r_1 and hence, for any A, 

G{r;A) < m(m- l) r ~ l p' c = 1 + o(l) , 

by condition (2) of Theorem 1 1.31 It remains to consider the case r > 2m . Assume by contradiction 
that there exists some r > 2m such that r is the maximizer in the definition G* (r) and that G* (r) > 21c 
where c is the constant from 14.lt . Fix such r and let A = A{r) be the maximizing set as in 14.lt . Now, 
putting 5 = r in 14.lt gives 

F(2r; A) > cG* (r)F(r; A) > 2F(r; A) . 

Putting 5 = 2r in 14.lt gives 

F{3r;A) > cG* (r)F{2r;A) > AF(r;A) , 

and so, by induction, for any k, 

F(kr;A)>2 k - 1 F(r;A). 

We have reached a contradiction, since on the right hand side we have a quantity going to oo in k 
(note that A cannot contain 0, otherwise it will not be maximizing, so F(r ; A) > 1) and on the left hand 
side our quantity is bounded by V. □ 

We now wish to obtain the reverse inequality to Theorem l4.11 that is, a lower bound to E|dB(r)|. Of 
course, this cannot hold for all r but it turns out to hold as long as r « V 113 . This is the correct upper 
bound on r because the diameter of critical clusters is of order V 113 (see 1461 ). 

Lemma 4.2 (Lower bound on critical expected ball). LetG be any transitive finite graph and put p = p c 
where p c is defined in U.4t . Then, there exists a constant f > such that for allr < £V 113 , 

E\B{r)\ > r/4. 

Proof. For convenience write c = 1/4. Assume by contradiction that E|6(r)| < cr. Given this assump- 
tion, we will prove by induction that for any integer k > 0, 

E|B([r(l + jfc/2),r(l + (fc+l)/2)])| <2 k+1 c k+2 r. (4.3) 



HYPERCUBE PERCOLATION 27 

Let us begin with the case k = 0. Since E|B(r)| < cr there exists r' £ [r/2, r] such that E\dB(r')\ < 2c. 
Hence, 

E|5([r,3r/2])| = J^P{B(r') = A)E[\B([r,3r/2])\\B(r') = A] 

A 

= £P(fl(r') = A)E[ £ E\B a &rl2-r';A)\] 

A aedA 

< crE\dB{r')\<2c 2 r, 

where the inequality follows since E|B fl (3r/2 - r'\ A)\ < E|B(r)| < cr by monotonicity and for any A. 
Assume now that (14 .3D holds for some k and we prove it for k+ 1. Since it holds for k, we have that 
there exists r'e [r(l + fc/2),r(l + (A:+l)/2)] such that E|dB(r')| < 2 k+2 c k+2 . By conditioning on B(r') = A 
as before we get that 

E|B([r(l + (jfc + l)/2),r(l + (jfc + 2)/2)])| < cr -2 k+2 c k+2 , 

concluding the proof 14.31 . Now, since c < 1/2 it is clear that the sum over k of 114.311 is at most Cr, 
contradicting the fact that E Pc 1^(0)1 = XV 113 by our definition of p c in 11.41 . Note that the constant % 
may depend on A. □ 

Lemma 4.3 (Lower bound on expected boundary size). Let G be a transitive finite graph for which 
fi.5D holds and put p = p c . Then there exists constants c, £ > such that for any r <^V 113 , 

E\dB{r)\ > c. 

Proof. By Lemma l4~2l and Theorem l2.1l we have that E|B([2r, Cr])| > r for some large fixed C > 0. Also, 
E\B(Cr)\ 2 <£P(0^x,0^y) < ^ P(0-z)P(z-i)P(z-y)<Cr 3 , 

x,y x,y,z 

by Theorem l2.1i By the inequality 

PCX > a) > (EX - a) 2 /EX 2 (4.4) 
valid for any non-negative random variable X and a < EX, with a = 0, 

V{dB{2r)^0)>clr, 

for some c> 0. Furthermore, given B{r), each vertex of 3B(r) has probability at most Cr" 1 of reaching 
dB(2r) by Theorem l2.11 Hence, for any ( > 0, 

P(dB(2r) and |d5(r)| < (r) <C(/r. 

We now have 

P(|d£(r)| > (r) > V{dB{2r) ? 0) - P(dB(2r) ^ and |d£(r)| < (r) 
r r 

and the lemma follows by choosing ( > small enough. □ 

4.2. In the supercritical regime. In this section, we extend the volume estimates to the supercritical 
regime. The following is an immediate corollary ofTheorem l4.lt 

Lemma 4.4 (Upper bound on supercritical volume). Let G be a graph satisfying the assumptions in 
Theorem lL^ and consider percolation on it with p< p c [l + e). Then for any r and any Ac V we have 

E\dB(t;A)\ < C(l + e) f , and E\B(r;A)\ < C£ _1 (l + ef . 

Proof. The first assertion is immediate by Theorem l4.1l and Lemma l3~4l The second assertion follows 
by summing the first over t < r. □ 
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The corresponding lower bound is more complicated to obtain, and as before, can only hold up to 
some value of r. In conjunction with Lemma l4~4l it identifies E|B(r)| = Q(£r _1 (l + e) r ) for appropriate 
r's. 

Theorem 4.5 (Lower bound on supercritical volume). Let G be a graph satisfying the assumptions in 
Theorem \1.3\ and consider percolation on it with p = p c {\ + e). Then for any r satisfying 



e 2 V 

(logins 



mir)\< — j— 4 , (4.5) 



the following bound holds: 

E\B(r)\ > C£ _1 (l + e) r . 

Proof. First, we may assume that 

r <£ _1 log(£ 3 y), (4.6) 

since otherwise the assumption of the lemma cannot hold together with the conclusion. Recall now 
the simultaneous coupling (described at the end of Section HJ] between percolation at p\ = p c and 
P2 = p c (l + £). Let 

s&iix) = {0 <— * x in G P2 }, 
and given a simple path r] of length £ between and x, write 

srfg{x,r]) = {0 «— * x in G P2 and n is the lexicographical first p 2 -open path between and x} , 

so that srfeix) = \} n sd£{x,r(). Write £§g(x,ri) for the event that the edges of n are in fact pi-open (not 
just p 2 ). We have that 

=£ 

ste{x,r\) r\£$£{x,r}) c {0 <— * x in G pi }, 

so, 

|J s^e{x,rj) n &{{x,rj) s {0 l > r ~ £ x in G Pl }. (4.7) 

£e[r-e~ 1 ,r],ri 



We will show that 



£p( t-^xinG Pl \ y ^(x,i7)n5l/(jc,i7)) = o(£" 1 ), 

x ^e[r-e -1 ,r],jj 



(4.8) 



and first complete the proof subject to J4.8II . Since {s^({x,r])r\^£{x,r])}f >T1 are disjoint events, 114.711 and 
114.811 show that 

£ PGfi//(jc,i7) n SB e {x,rj)) > E Pl |B([r - e" 1 , r])| - o(£ _1 ) > ce -1 , 

x,^6[r-e _1 ,r],7) 

where the last inequality used Lemma l4~3l and the fact that r « V 1/3 by 114.6H and e » V~ 1/3 . From this 
the required result follows since 

P(»/(x,i7) I ij)) = (l + er e , 
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which implies that 

E P2 |B(r)|> £ PW/(x,»7)) (4.9) 
£ P(^(x,?7))P(^(x,r/) |^(jc,77))(l + e/ 

x,^e[r-e _1 ,r],77 

>(l + £) r -^ 1 £ P(^U,77)n^U,7/)) 
> (l + e) r-e ~ 1 ce~ :l . 

Thus, our main effort is to show 14.81 1 under the restriction of 114.511 and 14.61) . Fix x and assume that 
the event 

{0 ' r ~ g 1,r ' xinG Pl }\ |J s4£(x,Tj)n38i(x,Tj), (4.10) 
ee[r-e~ 1 ,r],ri 

occurs. In words, this event means that either the shortest ^2-open path is shorter than the shortest 
pi-path or that they have the same length but the lexicographically first shortest £>2-path contains 
an edge having value in [p\, p 2 ]. This implies that the /?2-path shortcuts the /?i-path. Formally, given 
vertices u, v and integers £ e [r - e _1 , r], k e [0,/], t e [2,£] with k+ t < £ write 3~{u, v,x,£,k, t) for the 
event that there exists paths 1/1,772,^3,7 in the graph such that 

(1) rji is a shortest pi-open path of length k connecting to u, 

(2) 772 is a shortest pi-open path of length t connecting u to v, 

(3) 773 is a shortest pi-open path of length £ - t-k connecting v to x, 

(4) B x {£-k-t)nB o (k) = 0mG pi , 

(5) 7 is £>2-open path of length at most t connecting u to v and one of the edges of 7 receives value 
in [pi,p 2 ], and 

(6) 771,772,773 and 7 are disjoint paths, 

see Figure |U The event 14.101 1 implies that 3~{u,v, x, £, k, t) occurs for some u, v, £, k, t satisfying the 
conditions above. Our treatment of the case t > mo is easier than the case t < mo so let us perform 
this first. When t > m we forget about condition (4) and the special edge with value [p\,p2\ in (5) and 
take a union over £, k and t e [m , r] of the event 3~{u, v, x, £, k, t). This union implies the existence of 
vertices u, v such that the following events occur disjointly: 

(1) ^* u in G pi , 

, n1 Plm ,r] . 

(2) u 1 ► f in G pi , 

(3) f ^ x in G pi , 

(4) v in G P2 . 

Indeed, the witnesses to these (monotone) events are the paths 771,772,773,7. We now wish to use the 
BK inequality, however, as the astute reader may have already noticed, our witnesses are not stated 
in an i.i.d. product measure. Let us expand briefly on how we may still use the BK inequality. We 
may consider our simultaneous coupling measure to be an i.i.d. product measure by putting on each 
edge a countable infinite sequence of independent random bits receiving with probability 1/2 and 1 
otherwise such that this sequence encodes the uniform [0, 1] random variable attached to each edge. 
In this setting, a witness for an edge being p-open is the sequence of bits attached to the edge and 
similarly for the edge being p-closed. Similarly, we define this way events of the form "E\ in G pi 
occurs disjointly from E 2 in G pz ". With this definition of witnesses we may use the BK inequality here 
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to bound the probability of the union above and sum over x (as in H4.8I I. This gives an upper bound of 

£ P Pl (0 ~ k)P Pi (u £^ v)P pi (v ~ x)V P2 {u ~ v) . 

u,v,x 

We sum over x and get a factor of r by Theorem 12. li We bound P Pl (w f [m °' r } v ) < CrV~ l by Lemma 
13. 131 and Theorem l2.ll We then sum over v and get a factor of E p2 \B(r)\ and on u to get another factor 
of r. All together this gives an upper bound of 



Cr 3 E„ 2 \B(r)\ 



-Pz 



O^log-V^^oCe" 1 ), 



V 

by g3}. 

We now treat the case in which t e [2, m ]. We claim that the event 3~{u, v,x,£, k, t) implies that 
there exist disjoint paths 772,7 between u and v such that \r) 2 \ = t and |y| < t and the intersection of 
the following events occurs: 

(a) 7)2 is pi-open, 

(b) 7 is p 2 -open, and one of its edges receives value in \p\,p2\, 

=k —£—k—t 

(c) <— <■ u off 772 u 7 and ^ ► x off r] 2 u 7 u Bo(^) in G pi . 

Indeed, let 7/1,772,773,7 be the disjoint paths guaranteed to exists in the definition of 3~{u, v,x,£, k, t). 
The paths 772 and 7 show that both (a) and (b) indeed occur (note that we have relaxed the requirement 
that 772 is a shortest pi-open path). Seeing that (c) occurs is more subtle. First observe that for any 
two vertices z, y and integer £ > 

{z^yoffA}= [J ({/Ms open} f] has a closed edge}] , 

y6:|/3|=^,/3nA=0 P':\P'\<£,l5'nA=0 

where /3, /3' are simple paths in G and we slightly abuse notation and write /3 n A = to denote that 
the edges of /3 are disjoint from the edges touching A. To see that (c) holds we note that the event 
3~{u,v,x,£,k, t) implies that 773 is of length A: between and u, disjoint from 772U7, is pi-openand 

any shorter path between and u has a p\ -closed edge in it; in particular, •>— ► u off 772 u 7 occurs in 
G Pl . Similarly 773 is of length £ -k-t between v and x, is disjoint from 772 u 7 u B {k), is pi-open and 

any shorter 
occurs in G r 



any shorter path between v and x has a pi -closed edge in it; in particular v *^ ► x off 772 u 7 u B (k) 



'Pi- 

Now, the events (a), (b), (c) are independent since they are measurable with respect to disjoint sets 
of edges (the edges of 772, 7 and all the rest). The probability of their intersection is hence 

21 p l J l [ 1 - (pi lp z ) lrl ] P P1 (0 ^ u off 772 u 7 and v x off 772 u 7 u B (fc)) , 
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where the factor [l-(/?i/p2)' rl ] is the probability that one edge of 7 has value in [p\,p2\ conditioned on 
all edges being p 2 -open. We compute the probability on the right hand side as usual by conditioning 
on B {k), this gives 

p l ? 2l p l J l [l-(pi/p 2 ) lrl ] E P pi (Bo(fc) = A)P pi (v^^xoffAuj72Ur). 

A:0~u 

We now start summing all this over u, v, x, £, k, t, rj 2 ,Y. We start by summing over x the last probability, 
giving as a constant factor by Theorem l4.11 The sum over A gives a term of P pi (0 <— * u off 772 u 7) which 
we sum over k e [0, r] and bound this by P P1 (0 <-> u). Furthermore, the number of possible 772 's is at 
most m{m - l) f p f (u, v) and if \j\ = s < t, then the number of such 7's is at most m{m - \) s ~ l p s {u, v). 
We also bound [1 - {p\ I p-£) s \ < Cse. All this gives that 

E P{3~{u,v,x,£,k,t))<Ce E (m-l) s+f p{p|sp f (w, i/)p s (u, i/)P pi (0 ~ u). 

w, y,x,^,fc,fe[2,mo] w, y,^ 

fE[2,mol>se[l,f] 

By condition (2) of Theorem l 1.3 l and the fact that mo = o(e _1 ), we have that {m-l) s+t p[p s 2 = l + o{a m ), 
so we may bound this sum by 

C£EP Pl (0~ u) E sp\u,v)p s {u,v). 

i/,fe[2,mo],se[l,fl 

The sum over £ e[r- e" 1 , r] gives a factor of e _1 , and since G is transitive, the second sum over v, t, s 
does not depend on u. Hence we may sum over u separately using Theorem l2.1l giving a bound of 

Cr E sp\u, v)p s {u, v) . 

v,te [2,m ],se[l,t] 

For each 5 > 1 and s\ e {1, . . . , s}, we can bound 

p s (0, v) < T p Sl (0, w)p s ~ Sl (w,v), 

m-lV 

because the number of non-backtracking paths of length 5 from to v is at most the sum over w the 
number of non-backtracking paths of length s\ from to if times the number of non-backtracking 
paths of length s-S\ from w to v (the factor ml (m - 1) comes from properly normalizing these num- 
bers). As a result, 

ffl C(X 

£ 5 p f (0, v)p s {Q, !,)<—-£ £ p t {0,v)p s '(0,w)p S2 (w,v)<— f 

u,te[2,m ],s£[l,t] m 1 v, w fe[2,m ],sie[l,f],s 2 <si i°g ^ 

by condition (3) in Theorem l 1.31 and the fact that t + s\ + s 2 ^ 3. All together we get that 

Crcc 

E P(^"(K,i/ l x,/ l fc,f))<- — ^ = o(e _1 ), 

u,v,x,£,k,t£[2,mo] % 

by our assumption 1 14.61 1 and since a TO = o(l). This finishes the proof of 1 14.81 1 and concludes the proof 
of the theorem. □ 

The following are easy corollaries: 

Corollary 4.6. Let Gbea graph satisfying the assumptions in Theorem lL^ and consider percolation on 
it with p = p c (l + e). Then for any r satisfying 

r<e~ l [ log(£ 3 10-4 loglog(e 3 V) ] , 

the following bound holds 

E|£(r)| = e(£- 1 (l + e) r ). 
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In particular for r defined in (2.8\ ) 

E\B(r )\ = &(^a m eV). 

Proof. The upper bound follows from Lemma l4~4l and the lower bound from Theorem l4.5l □ 

Lemma 4.7. Let Gbea graph satisfying the assumptions in Theorem ] 1.3\ and consider percolation on it 
with p < p c {\ + e). Let r be an integer satisfying the assumptions ofTheorem \4.$, Then, 

E\B{r)\ 2 < Ce _1 (E|B(r)|) 2 . 
Proof. If ^* x and ^ y, then there exists a vertex z and an integer t < r such that the event 

—t r—t r — t 

{0 <— > Z\ o {z *— ► X} o {z — * y} . 

Apply BK-Reimer and sum over x, y and then z to bound 

E|5(r)| 2 < £ E\dB{t)\E\B{r - t)\E\B(r - t)\ . 

We apply Lemma lOl and Theorem l4.1l to bound 

E|B(r)| 2 <C£" 3 (l + e) 2r , 

and Theorem l4.5l gives the required claim. □ 

5. An intrinsic-metric regularity theorem 

For an increasing event E and a vertex a, we say that a is pivotal for E for the event that E occurs 
but does not occur in the modified configuration in which we close all the edges touching a. We write 
Piv(ij) for the set of pivotal vertices for the event E. For vertices a, x, radii ri,j x and A c V, we define 

G rujx (a,x;A) = E[\{u: a f [2m °' ri , 1 u ff ^\ {a} and a e Piv({jc — u})}| | B x {j x ) = A] . 

Definition 5.1 (Regenerative and fit vertices), (a) Given vertices x, a, radii r±,j x > m and a real num- 
ber f> > 0, we say that a is (/3, j x , r{j -regenerative if 

— j x 

(1) x *— * a, and 

(2) G,- Ux {a,x;B x (jx)) > (l-j8)E|fl(n)|, 

and note that this event is determined by the status of the edges touching B x {j x ). We say that a is 

=j x 

(/3, j x , r^-nonregenerative ifx <— * a but it is not [ft, j x , ri) -regenerative, 
(b) Given an additional real number 6 > 0, we say thatx is [5, f>, j x , r^-fit if 

(1) dB x {j x ) ^ holds and, 

(2) the number o/(/3, j x , r{) -nonregenerative vertices is at mostde' 1 . 
It will also be convenient to combine our error terms. For this, we define 

(o m = a^ + emo, (5.1) 

so that co m = o(l). Our goal in this section is to prove that if dB x {j x ) ^ 0, then x is fit with high proba- 
bility. This is the intrinsic metric regularity theorem discussed in Section l2T5l 

Theorem 5.1 (Intrinsic regularity). Let G be a graph satisfying the assumptions ofTheorem \1.3l Let 
p = p c {l + e), let r = r m = M/e where M = M m is defined in (2.7V and ri e [e _1 , r ], where r is defined 
in (2.80 . For any 6, fie (0,1) there exist at least (1 - 0{o}\l^))r radii j x e [r,2r] suchthat 

P(xis (5,/8 f j x ,n)-fit) > [l-oid^p-^o^vmxUx) * 0). 

We start by proving some preparatory lemmas: 
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Lemma 5.2. Assume the setting of Theorem \5.1[ Then, 

2r 

£ XP(^^^«^ ! ^"offB x (; x )\{a})>(l-0(w w1 ))E|B([r,2r])|E|B(r 1 )|. 

jx = ra,u 

Proof. We condition on B x {j x ) = A for any admissible A (that is, A for which the event x*-* a occurs 
and PCB x (j x ) = A) >0). Then, 

P(fl M off \ {a} | = A) = P(a u off A \ {a}) , 

and 

P[2m ,ri] P[2mo,ri] P[2m ,ri] ... .. 

P(a i — u off A \ {a}) = P(a <- ► u) - P(a < ► u only on A \ {a}) . 

Summing over the first term gives 

£ P(x^a)P(fl^^i/)>E|B([r,2r])|(E|B(ri)|-E|B(2mo)|) 

a,u,i x £[r,2r] 

= a-0(w m ))E|fl(ri)|E|B([r,2r])|, 

since E|5(2m )| < Cmo by Corollary 13.51 and E|B(ri)| > ce~ l by Theorem 14.51 and since r\ > e~ l . It 
remains to bound the sum 

£ ^P(Bx(;'x) = A)P(a^^ a only on A \ {a}). 

a,u,j x e[r,2r] A 

As usual, if a < _^2^ t . M on iy on A \ {a} occurs, then there exists ze A such that {a *-^> z} o {z u}. BK 
inequality now gives 

£ £P(B X (; X ) = A) £ P(a^z)PU-a). (5.2) 

a,u,j x e[r,2r] A zeA\{a] 

We sum over w and extract a factor of E|B(ri)|. We then change the order of summation, so the sum 
simplifies to 

E|B(ri)| £ P(x-a,i-z)P(fl-z). 

a,z^fl,7 x e[r,2r] 

We sum over j x (noting that the events x <— * a, x «- >• z are disjoint as y x varies) and bound this sum by 

E\B(n)\ E PU— a,x^z)P(a^z). 

As usual, ifx^a and jc <-^» z, then there exists z' such that the event 

i 2r /. . i 2r / 2r 

{X — <• Z } o {z <— - Z] o {z 

occurs. By the BK inequality we bound the above sum by 

E|B(n)| £ P(^z')P(z'^z)P(z'^fl)P(fl-z)- 

We may now sum over a and z ^ a using Corollary l3 . 1 8l and then sum over z' to get that this is bounded 

by 

E|B(n)|(E|5(2r)|) 2 



CE|B(n)|E|fl(2r)| 



dim + ■ 



V 

This concludes our proof since the second term in the parenthesis is of order at most a},( 2 e 4M (e 3 V)~ ] 12 

.1/2 



a)j? by the upper bound on r 1; our choice of r and M in 1 12.71 1 and Corollary l4.61 □ 
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Lemma 5.3. Assume the setting ofTheorem \5.1\ There exists aC>0, such that 

£ J^P{x = Jia,a p J^l u ftB x {j x ) \ {a}, a e Piv({x 3 ~ u})) > (l - 0(a) m ))E\B([r,2r])\E\B( ri )\. 

j x=r a,u 

Proof. Fix j x e [r,2r]. We rely on Lemma [5721 and bound the difference in the probabilities appearing 
in Lemma lBTSl and the one above. If the event 

{x *^ a, a P[2m ° ,n } u ff B x (j x ) \ {a}} (5.3) 

occurs but a £ Piv({x ^— u}), then there exist z\,Z2 and t < j x and paths 7/1,772,71,72-73 such that 

(a) 71 is an open path of length at most r\ connecting a to z 2 , 

(b) 72 is an open path of length at most r\ connecting z 2 to u, 

(c) 73 is an open path of length at most r\ connecting z\ to Z2, 

(d) 771 is a shortest open path of length precisely t connecting x to z\, 

(e) 772 is a shortest open path of length precisely j x - t connecting Z\ to a, 

(f) 71-72. T3> r li>il2 are disjoint. 

See Figure |3j Indeed, assume that a is not pivotal for x ^— ^ u and J5.3H holds. Let 77 be the first 
lexicographical shortest open path of length j x between x and a and 7 a disjoint open path of length 
in [2m , between a and u off B x {j x ) \ {a} which we are guaranteed to have since H5.3H holds. Since a 
is not pivotal we learn that there exists another open path /3 between x and u of length at most j x + r\ 
that does no? visit a. Hence, /3 goes "around" a, or in formal words, there exists vertices Z\ and Z2 on /3 
appearing on it in that order such that Z\ e 77 and Z2 e 7 and the part of 77 between z\ and Z2 is disjoint 
from 77 u 7. We take t < j x to be such that 77(f) = z\ and put 771 = 77 [0, ^,772 = r][t,j x ]. We take 73 to 
be that section of /3 between z\ and Z2 and 71,72 be the sections of 7 from a to z 2 and from z 2 to u, 
respectively. 

For all j x , t e [r,2r] these events (that is, the existence of Z\,z 2 and the disjoint paths) are disjoint 
since 771 and 772 are required to be shortest open paths. The union of these events over j x , t implies 
that there exists Z\, z 2 such that 

{x 2-* zi) o [zi ^ a} o {a z 2 } ° {z x ^* z 2 } o {z 2 ~ u} , 
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since we can just take 771,772,71,72*73 as our disjoint witnesses. Using BK inequality we bound the 
required sum from above by 

X P(x ~ zi)P(zi ~ a)P(a ~ z 2 )P(zi — z 2 )P(z 2 ~ u) . 

a,u,z\,Z2 : 

Summing first over u extracts a factor of E|B(ri)|, and we sum over a and Z2 using Corollary l3.18[ and 
lastly sum over Z\ . This gives a bound of 



CE|B(ri)|E|B(2r)| 



(E|B(n)|) 2 E|B(2r) 



V 

We apply Lemma I5T21 to conclude the proof since the second term in the parenthesis is of order at 
most a m e 2M < a^ 2 by the upper bound on r\, our choice of r and M in 112.71 ) and Corollary l4.61 Also 
note that E\B([r,2r)\ > E|fi(2r)|/2 by CorollarySHand our choice of r and M. □ 

Lemma 5.4. Assume the setting ofTheorem \5.1\ For any vertices x, a, 

£ £p(jc ~ a, a— u off B x (j x ) \ {a}) < (1 + 0{o) m W~ l E\B(n)\E\B{[r,2r])\. 

jx = r u 

Proof. The event x*-^ a, a p , [2mQ ' n } u ff B x (j x ) \ {a} implies that 

{x^- a}o{a^ u], 

the second witness is the open edges of an open path of length in [2m , r{\ off B x {j x ) \ {a} and the first 
witness is the lexicographically first shortest open path of length j x between x and a together with all 
the closed edges of the graph. The BK-Reimer inequality gives that 

P(x<-^ a, a P[2m ° ,ri } uofiB x (j x ) \{a}) <P(x<-^ a)V{a-^ u). 
We sum over u and j x e [r, 2r] get that the sum is bounded by 

Lemma l3T4l gives that 



E|B(n)|P(Jc^a). 



P(x ^ a) < (1 + 0{(o m )) V^ElBdr - m ,2r - m ])| . 

We have that 

E|B([r-mo,2r-m ])|<E|B([r,2r])| + E|B([r-mo,r])|<(l + 0(£m ))E|B([r,2r])| 

since E|B([r - m , r]) \ < Cm (l + e) r by Theoreml4~T1and Corollary[331and since E\B([r,2r]) > ce~ l {\ + 
e) 2r by Theorem l4.5l and Lemma l4~4l (we use the assumption that r » e -1 ). Hence 

P(x^ a) < a + 0{a) m ))V- l Emir,2r])\, (5.4) 
concluding our proof. □ 

Proof of Theorem l5.ll By combining Lemmas l531 and l5~4l we deduce that for any x there exist at least 
(1 - 0(<w^ 2 )) V vertices a such that 

2r _ . 

£ £p(jc ~ a, u ofi B x {j x ) \ {a},ae Piv({x — u})) = (1 + O^ 2 )) V^EmnmBUr^r])] . 

Write G for the variable 

2r 

G = E G A ., ri (a,x;B x (7 x ))l =jx 

Jx— 1 
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Note that G is a random variable that is measurable with respect to B x {2r) (that is, it is determined 

[r 2r] 

by the status of the edges touching B x {2r)) and that it equals unless x ^— ■* a. Furthermore, only 
one of the summands can be nonzero because the events in the indicators are disjoint. Our previous 
inequality can be rewritten as 

EG = (1 + 0(w^ 2 ))y- 1 E|B(r 1 )|E|B([r,2r])| . 
Hence, for at least (1 - 0{d)]^)) V vertices a, 

E[G|x^a] > (l-0(ct^ 2 ))E|B(ri)|, (5.5) 

by Lemma l3.14l This gives the conditional first moment estimate. The second moment calculation is 
somewhat easier. We have 

EG 2 = £ E E[p(a^^Hi off B x (j x ) \ {a} \B x [j x ))P{a??^ u 2 ottB x [j x )\{a}\B x U x ))l =j x . 

j x = rui,u 2 \x—a\ 

We bound, almost surely in B x (j x ) and for i = 1,2, 

p (a P[2mo^ Ui of £ Bxijx) \ {a} | Bxijx)] < P(a JX Ui ), 
and sum over u\ and u 2 to get that 

EG 2 l { ^ fl) < [E|B(n)|] 2 P(x^a), 

so that 



E[G 2 |x^fl]<[E|5(n)|] 2 . 
Combining this with 115.51 1, we obtain 

Var(G| x™ a) = 0([E|B(ri)|] 2 ^ 2 ) . 
By Chebychev's inequality, for any /3 > 0, 

P(G < (1 - P)E\B(n)\ \xWa) = Oip- 2 ^ 2 ) . 
Recall that this holds for at least (1 - 0(w^ 2 )) V vertices a. Call these vertices valid. We have 

£ p(x~ a,G< (l-j3)E|3(n)|) = O(E|5([r,2r])| ) 0- 2 w^ 2 ), 
a valid 

by our previous estimate. Also, since there are at most Oico 1 ^ 2 V) invalid a's, we apply 115.41 1 to bound 
the sum over all a by 

£p(x ™a,G<{\- j8)E|fl(n)|) = 0(E|B([r,2r])|/r 2 ft^ 2 ) . 

Returning to our original notation, we rewrite this as 

2r . _ . 

E E p ( x ^ fl '^^i^^ B x(;x))<(l-i3)E|B(r 1 )|) = 0(E|B([r,2r])| ) 6- 2 w^ 2 ). 

Hence, there are at least (1 - 0(wj,( 4 ))r radii j x e [r,2r] such that 

Y,P(x~a,G jxi rMx;B x (j x ))<a-m\Bin)\\ = 0(E|B([r,2r])|r-^- 2 w^ 4 ) 

a 

- (J(e p £o m J, 

where the last inequality is by Lemma l4~4l Given such j x , write X{j x ) for the random variable 

X{jx) = \{a: x^a,G jx>n {a,x;B x Ux)) ^ d - pW(n) |}| , 
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so that EX[j x ) < Ce p o*m • The variable X{j x ) equals the number of {/3, j x , r\) -non-regenerative 
vertices. By Markov's inequality we get that for any 8 > 

P(X(j x ) > 5e- 1 ) = 0{e8- l p- 2 e 2M ^) , 

and we conclude by Lemma l3.6l that at least (1 - 0(<w^ 4 ))r radii j x e [r,2r] satisfy 

P(dB x (j x ) and X(j x ) < de' 1 ) > (l - 0{8~ l p~ 2 e 2M ^^{dBAj^ ? 0) , 

as required. □ 

6. Large clusters are close 

In this section, we prove Theorem 12.41 which shows that many closed edges exist between most 
large clusters. This section involves all our notation from the previous sections and in order to ease 
the readability we include a legend in terms of V, m, m ,e 

m>^m which are given in Theorem [TT3] in 

Table 1 at the end of the paper. We define p, k, £, (, 8 as 

P=(\ogM)~ 2 , k=-^—, £ = (logM] 1, \ C = UogMT 1/8 , 8 = (12. (6.1) 

logM 

For notational convenience we also denote 

P[2m ,r ],x P[2m ,r ] j x + r 

{a< — ->■ u} = {a^ — — -»• u} n {a e Piv(x ■>— - u)}, 

P[2m ,r ],y , P[2m ,r Q ] f , n . , h + r o , 

{b <- ► u } = {b « — u}n{be Piv(y ^— ■* u )} . 

Let Sj x j ro (x, y) be the random variable counting the number of edges {u, u') such that there exist 
vertices a, b with 

(1) ^{x,y,j x ,j y ), 

(2) x *^ a and y «-* b and 

P[2m ,r ],x 

(3) a < ► u and 

P[2m ,r ],y , . 

(4) fr< w'offB x (; x + r ). 

Further define 

S 2 r,2r,r (*, y) = u') : {x — o {y — M '} , \B U (2r + r )| ■ \B u i (2r + r ) | > e 40A V 2 (E|B(r ) |) 2 } | . 
We will use the fact that for any j x , j y e{r,...,2r} 2 

S 2 r+r (X, y) > S j x>jy>ro {X, y) - S 2r ,2r,r [x, y) , (6.2) 

where S2 r+ro (x, y) is the random variable defined above Theorem l2.41 Finally write si (x, y, j x , j y , r , p, k) 
for the intersection of the events 

(1) £^{x,y,j x ,j y ), 

(2) \B x (jx)\ < e- 2 (l + e) 3r and \B y [j y )\ < e~ 2 (l + e) 3r and \dB x {j x )\ < e^l + e) 3r , 

(3) \dB x {j x )\ > e^V 1 and |dfl y (j y )l > e kl4 £-\ 

(4) xis (1, )6, 7' x ,r ) -fit and y is [l,p,j y , r )-fit, 
(5) 



E 



^V-^e-^EIBfro)!)^^ 2 . 



This event is measurable with respect to B x {j x ),B y {j y ). The following three statements will prove 
Theorem[23J 

Lemma 6.1. Assume the setting ofTheorem \2.4\ Then, 

E|{x,y: s4{x,y,2r,2r) and S 2r ,2r, rQ (x,y) > p ll2 V- 1 me- 2 (E\B(r )\) 2 }\ = o{e 2 V 2 ). 
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Theorem 6.2. Assume the setting ofTheorem \2.4\ Then there exists radii ji, . . ., jg e [r,2r] such that for 
at least (1 - o(l)) V 2 pairs x, y, 

p[s*{x,y,2r,2r) and f) *t{x,y,j x ,j y , r ,j8, A;) c ) = o(e 2 ) . 

Theorem 6.3. Assume the setting of Theorem \2.4\ and let x,y be a pair of vertices. Then, for any radii 
j x ,j y e{r,...,2r} 2 , 

P{S jx>jy , ro {x,y)<2fi ll2 V- 1 m£- z m\B(r )\f and si {x, y, j x ,j y , r , (5, k)) = 0{/3 ll2 e 2 ). 

Proof of Theorem l2.4l subject to Lemma lBTTl and Theorems l6.2l - l6T3l Lemma lBTBl shows that 

\{x,y: sd{x,y,2r,2r)\\ P 

4e 2 V 2 ~^ ' 

Thus, it suffices to prove that 

|{x,y: stf{x,y,2r,2r) and x,y are not (r, r ) -good} | P 

~^V 2 > °' 

Lemma l3. 101 shows that 

\{x,y:*f(x,y,2r,2r) and|<#(x)| < (e 3 !/) 174 ^ 2 or \^(y)\ < {c 3 V) ll4 e- 2 }\ P 

'e 2 V 2 >0 ' 
We are left to handle requirement (3) in the definition of (r, r )-good. Let ji, . . . , jg be the radii guar- 
anteed to exist by Theorem l6.2l and let x, y be a pair of vertices for which the assertion of Theorem l6.2l 
holds. Theorem l6.2l asserts that the number of such pairs is (l-o(l))V 2 so the sumofP(.2/(x,y,2r,2r)) 
over pairs not counted is o(e 2 V 2 ). Write /(x),/(y) for the lexicographically first pair ij x ,jy) £ Ui,---,je} 2 
for which the event sd (x, y, j x , j y , r , £>, k) occurs, or put J{x) = J{y) = oo if no such j x , j y exist. Theorem 
I6.2l states that for at least (1 - o(l)) V 2 pairs x, y 

P{sJ(x,y,2r,2r),Kx) = oo,/(y) =oo) = o(e 2 ). 

Theorem l6.3l together with the union bound implies that for any such pair x, y 

L P{S jx , jy , ro (x,y)<2l3 ll2 V- 1 me- 2 (E\B(r )\) 2 ,J(x) = j x ,J(y) = j y ) = 0{/5 1,2 £ 2 e 2 ) , 

]'x,jy£{jl,-,j(} Z 

which is o{e 2 ) by our choice of € and f> in H6.1IL By these last two statements we deduce that 

E|{x,y: ^(x,y,2r,2r) and Vj x ,j y S jx>jyjQ {x,y) < 2p ll2 V- l m £ - 2 (E\B{r )\) 2 }\ = o(e 2 V 2 ). 
This together with 116.211 and Lemma lBTTl implies that 

E|{x,y: ^(x,y,2r,2r)and S 2r+ ,- (x,y) < f> ll2 V~ l me- 2 {E\B{r Q )\) 2 }\ = o{e 2 V 2 ) , 
concluding our proof since P 112 = (logMT 1 . □ 

6.1. Proof of LemmaED Bounding the error S2r,2r,r - m this section, we prove Lemma lBTTl We begin 
by providing some useful estimates. 

Lemma 6.4. Assume the setting of Theorem \2.4\ and let p = p c [l + e). There exists C > such that for 
any positive integer n 

(1) £ P[{u^x}o{ u ' ^y} and u^u')<C[me~ 5 (1 + e) 4n + a m Vme~ 2 [l + e) 2n }. 

x,y,(u,u') 

(2) £ P({M^x}o{u'^y}andx^-y)<C[me" 5 (l + £) 4w + a m Vme" 2 (l + £) 2w ]. 

x,y,(u,u') 
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Proof. We begin by showing (1). If {u «— <■ x} o {u' «-» y] and u <— u', then there exists vertices z\,z 2 and 
integers t\,t 2 <n such that the event 

-h , =t 2 j , /, , 2n-t\-t 2 , , n-t\ . . n-t 2 
{U^Z\,U <— Z 2 ,d Gp {u,u) >h+ t 2 }o{Zi « ► Z 2 }o{x< — *Zi}o{y< — ► z 2 }, 

or the event 

r =t\ i =t 2 , , u , i+i r 2n-t\-t 2 . . n-h , ( n-t 2 , 
{u<— zi,u *-* z 2 ,d Gp {u,u) >h + t 2 }o{ Zl < > Z 2 }o{x< — *Z 2 }o{y< — ► z X } , 

occur. See Figured] (a). Indeed, let 77 be the shortest open path of length at most 2 n between u and u' 
and let Y x ,u>Ty, u ' be two disjoint paths of length at most n connecting x to u and y to u', respectively. 
We take z\,z 2 to be the first vertices of y x , M andy y , M ' which belongs to r\. There are two possible order- 
ings of z\, z 2 on 77, that is, {u,Z\,z 2 , u') or {u, z 2 , z\, u'), which give the two possible events. Assume the 
ordering on 77 is {u, Z\,z 2 , u') (the two orderings give rise to identical contributions to the sum in (1)), 
and put fx, t 2 to be the distances on r\ between u and Z\ and between z 2 and u', respectively and write 
7/1,7/2 to be the corresponding sections of r\ and 773 is the section of 77 between z\ and z 2 . The paths 71 
and 72 are the sections of j x>u and y y M ' from x to Z\ and from y to z 2 , respectively. The witness for the 
first event is 77 1 , rj 2 together with all the closed edges of G p (the closed edges determine that 77 1 , r\ 2 are 
indeed shortest open paths, and that d,Q v {u, u') > t\ + t 2 ), for the second, third and fourth events, the 
witnesses are just 773, 71 and 72, respectively. 

We now apply the BK-Reimer inequality and bound the sum in (1) by 

_ v-^ — h 1 — tl 1 , /■, nt., 2n— 1\ — t 2 n—t\ n— 1 2 , 

2 2^ p (" <-> z i> u *-* z 2> «G P iu, u') > fi + f 2 )PCzi < - >z 2 )V{x< — >Zi)V{y< — > z 2 ) . 

x,y,z\,z 2 ,{u,u'),ti<n,t 2 <n 

We first sum over x, y and get a factor of Ce~ 2 (l + e ) 2 "- f i- f 2 D y Lemma l4~4l The event u •>— ► Z\, u' «-2 
Z2, dc p {u, u') > t\ + t 2 implies that u z\ and u' *-+ z 2 off B u {t\) hence we may bound its probability 
by 

£ PiBuih) = Amu' ~z 2 off A), 

A: u*-~>z\ 

and so we get an upper bound of 

Ce~ 2 V (1 + e) 2w " fl " f2 P(u ^ Zi)maxP(^ ~ z 2 off A)P{z 1 2 , n ~ h ~^ Z2 ) . ( 6 .3) 
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We bound this in two parts. If f 2 > m , then we use Lemma l3 . 1 5l together with Lemma l4~4l to bound, 
uniformly in A, P(z/ *-* z 2 off A) < CV _1 (1 + e) t2 . We then sum over z 2 and Z\ in that order using 
Lemma l4~4l and extract a Vm factor from summing over (u, u'). If £ 2 ^ mo and ti > mo, then we use 
Lemma l3.13l together with Lemma l4~4l to bound Pf u z\ ) < CV _1 (l + £) fl - Further, we use condition 

(2) in Theorem [L3] and e = o^m^ 1 ) to bound, uniformly in A, P{u' *-* Z2 off A) < Cp f2 (z/,z 2 ). We then 
sum over z\ and z 2 in that order using Lemma l4~4l and extract aVm factor from summing over (u, u'). 
All this gives an upper bound of 

Cme- 3 a + e) 4n £ (l + £)" tr ' 2 <Cffl£" 5 (l + £) 4 ", 

fl,f 2 <« 



as required. We next sum 116.31 1 over £1 , f 2 < mo- We first relax (1 + e) 2n fl f2 < (1 + e) 2n and P(zi 
z 2 ) < P(zi «— >■ z 2 ), and then sum over f 1( f 2 to get an upper bound of 

Ce~ 2 a + e) 2n £ p (" ~ z ^ ~ ^)P(zi ~ z 2 ) . 

Zl,Z2,iu,u') 

We now sum over Z\, z 2 using Corollary l3.18l and Lemma l4~4l We get that this is bounded by 

; C[me~ 5 {l + ef n + a m Vme~ 2 (l + e) 2n ] , 



2n-t\-t2 



CVme~ 2 {l + e) 2n 



m 2 e- 1 (l + e) 2n 



V 



+ a, 



since m <e \ as required. 

To bound (2) we proceed in a very similar fashion. If {u *— - x} o {u' — <• y} and x — ► y then there exists 
vertices Z\,z 2 and t\, t 2 < n such that the event 

r =fl =£2 t / w . , . , , 2n-h-t 2 , . «-fi , . ,«-f 2 , 
{*«— Zl,J/ «— Z 2 ,dG p U,y) " f l + r 2}°{Zi < - Z 2 }o{i/< ► Zi}o{u «— z 2 }, 



or the event 



= fi =f 2 , 2n-fi-f 2 «-fi , n-f 2 

{x<— z 2 ,y <— zi,d Gp (*,y) > ?i + f 2 }o{zi < ► z 2 }o{u* — ► zi}o{ u « — ► z 2 }, 



occur, by the same reasoning as before, see Figure|4] (b). Let us handle the first case only (the second 
leads to an identical contribution) . We appeal to the BK-Reimer inequality and as before we condition 
on B x {ti) and bound 

P{x^z h y^z 2 ,d Gp {x,y)>t 1 + t 2 )< £ P(fl x (*i) = A)P{y ^ z 2 off A). 

A:x->—>z\ 

We sum over y then x using Lemma lPl giving a bound of 

C £ (1 + £ ) fl+f2 P(u 2=£ zxJPUx z 2 )P(u' ^ z 2 ) . 

zi,z 2 ,(u,w'),fi,f 2 £w 

An appeal to Corollary l3 . 1 81 and Lemma lOl to sum over z\, z 2 gives a bound of 

r£ -3Q + £ -)4n-2(ti + f2) , 

CVm d + £) fl+f2 + u m <C[mE~ 5 (l + e) 4n + a m Vme~ 2 a + e) 2n ], 

h,h<n V 
where the last inequality is a direct calculation. □ 

Proof of Lemma l6Tfl For convenience put n = 2r + r . By Markov's inequality, the expectation we 
need to bound is at most 

4j8- 1/2 Vm- 1 e z (E|B(ro)|)~ 2 £ P({x<-^ u}o{y A u'},\B u {n)\ > e 20M e _1 E|B(r )|) . (6.4) 

x,y,(w,«') 
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We split the sum into 

Si= £ P{{x~u}o{yJlu'}AB u (n)\>e 2OM e- 1 E\B(r o )\andB u (n)nB u ,(n) = 0), 

x,y,{u, u') 

and 

S2 = ^ P({x *— * l/} o {y u'} , U *—* l/) . 

x,y,(u, u 1 ) 

We bound S\ using the BK inequality 

Si< £ P(xA M ,|B M ( n )|> e 20M £ - 1 E|B(r )|)P(yA M '). 

x,y,(w,w') 

Summing over y and x and then over (w, u') gives that this is at most 

Vrr^\B{n)\-E\B{n)\\ mm ^^^ nBm} \. 
We use the Cauchy- Schwartz inequality to bound 

E|B(n)|l {|B(w)| > e 20M £ -i E|B(ro)|! | < [E\B{n)\ 2 P(\B(n)\ > e 20M e^Em^)} 112 . 
We bound this using Lemma l4~7l and the Markov inequality by 

E|B(n)|l {|B(w)|ae2 oM,-i E | B(ro) | } |<Ce- 10M (E|S(n)|) 3/2 (E|B(r )|)- 1/2 , 

and conclude that 

Si < Ce" 10M ym(E|B(n)|) 5/2 (E|B(r )|)" 1/2 . 
We bound S 2 using part (1) of Lemma l6.4l by 

S 2 < C[me~ 5 (l + c) 4n + a m Vme~ 2 (l + e) 2n ] . 

We put these two back into 116.41 1 and get that we can bound this sum by 

CV 2 e 2 (E\B(n)\) 512 CVe- 3 (l + ef n Cma m V 2 {\ + e) 2n , , 



/3i/2 e iOM(E|B(r )|) 5/2 )6 1/2 (E|B(ro)|) 2 p 1,2 m(E\B{r Q )\) 2 
by our choice of r in 112.81 1 . n=ro + 2r,r = M/e, /3 = (logM) -2 and using Corollary l4.61 □ 

6.2. Proof of Theorem l6.2t Finding good radii. We proceed towards the proof of Theorem l6.21 Recall 
the choice of parameters in 16.H . 

Lemma 6.5. For any radius r > e~ l and any ( > 0, 

v(\dB(r)\ > and 3j e [e -1 , r - e" 1 ] with \dB(j)\ < fe" 1 ) < 0((e) . 

Proof. Assume that the event holds, and let / be the first radius j with j e [e~ l ,r - e~ l ] which has 
\dB{j)\ <(£ -1 . Conditioned on /and for 1 35 (r) | > to occur, one of the vertices on the boundary 
of B{J) needs to reach level r. Since r - j > e~ l , Corollary [33] and the union bound gives that this 
probability is at most C(. This together with the fact that the probability of \dB{j) | > is at most Ce, 
by Corollary l3.51 concludes the proof. □ 

In the lemma below, we write P^(0 = P(- off A \ B x {j x ) = A) and let E^ be the corresponding expec- 
tation. 

Lemma 6.6. There exists c > such that for any radius j x e [r,2r] the following statement holds. Let 
the set A be such thatx is (6,j3,j x , ke' 1 ) -fit and \dB x (j x ) \ > (e -1 when B x (j x ) = A. Then, 

P A {\ dB *Ux + fce _1 /2)| > e^e*' 4 ) > cf . 
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Proof. We perform a second moment argument on \B x {[j x + ke~ l l2,j x + ke~ l ]\ rather than on the 
required random variable. Since xis {8,(3, j x , Ax -1 ) -fit 

E A \B x 0jx,jx + ke~ l ])\ > (\dA\ - 6e- l )(l - /3)E| S ( A:^" 1 ) | . 

Furthermore, 

EA\B x {[j x ,j x + ke- 1 l2])\ < \dA\E\B{ke- l l2)\, 

by monotonicity. Since \dA\ > 28e~ 1 by our choice of ( and 8, and f> = o(l) (recall 1 16.11 1), Corollary l4.6l 
now gives us a lower bound on the first moment 

^A\B x ([j x + ke~ 1 /2,j x + he' 1 ])] > ^dAlEmke' 1 )] , 

To calculate the second moment, if u, v are counted in \B([j x , j x + Are" 1 ]) |, then either there exists two 
vertices a\, a 2 in d A such that 

{a\ < — ► u off A} o {a 2 < — * v off A} , 
or there exists a e dA, a vertex z and t < Are" 1 such that 

— ^ JcE^ t IcE^ t 

{a<-* z off A} o {z • u off A} o {z < y off A} . 
We apply the BK-Reimer inequality and sum over u, v. We get 

E A \B x ([j x ,j x + ks- 1 ])\ 2 <\dA\ 2 (E\B(ke- 1 )\f+ £ P A (a ~ z off A)(E|B(fc £ - 1 - t)\f . 

aedA,z,t<k£- 1 

We first sum over z using Lemma l4~4l then appeal again to Corollary l4.6l to get that 

E A \B x {[j x ,jx + ke~ l ])\ z < C(E|B(A:£- 1 )|) 2 [|aA| 2 + \dA\e~ 1 } . 

By S3), 

Va{\B x {[] x + ke~ 1 /2,j x + ke~ l ])\ > \\dA\E\B[.ke- l )\) > ^ 1 > cf , 

where the last inequality is since \dA\ > . By Theorem l4.51 we can write this as 

P A {\B x {[j x + ke- 1 /2,j x + ke- l ])\ > c(e- 2 e k ) > cf, (6.5) 

for some constant c> 0. Now, if \B x {[j x + ke~ 1 /2,j x + Ax _1 ])| > c(e~ 2 e k and |55 x (;' x + Ax _1 /2)| < £r _1 e fc/4 
occurs, then 

\dB x {j x + ke~ l l2)\<e~ l e kli and £ IB^/oT 1 ^; A)| > c(e~ 2 e k , 

v£dB x {j x +k£- l /2) 

must both occur. By the Markov inequality and Lemma l4~4l the probability of this event is at most 



2 Q 3fc/4 

= 0{C 1 e- lc " l ) = o(0, 



£ e — l„-fe/4 



c(e 2 e k 

by our choice of ( and k in 1 16.11 1 . Putting this together with [16.51 1 yields the assertion of the lemma. □ 
Lemma 6.7 (Finding good radii). There exists radii k\,...,kp in[r,2r] suchthat 

ki+i ~ ki> kc 1 , 

for all i = !,...,£ and 

P(xis(.8,p,ki,ke- l )-m) = (l + O((x) 1 l l 5 )mdB x (k i )^0), 
P(xis(l, ) 6,A: ; -,ro)-fit) = (1 + OiaffimdBM * 0) , 
P(xis(.8,p,ki + k£- l /2,r )-fLt) = a + O{a) l J l 5 ))P{dB x (k i + ke~ l /2)^0). 
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Proof. This is the only place where we use Theorem [5JJ Indeed, say a radius j e [r,2r] is good if it 
satisfies the three assertions of the proposition with fcj replaced by Three appeals to Theorem l5.ll 
give that at least (l-o(l))r radii j e [r,2r] are good by our choice of 6 and f>. Now, since £ k = o(M) and 
r = Me -1 it is immediate that there exist £ good radii which are ke~ l separated from each other. □ 

Lemma 6.8. For at least (1 - o(l)) V 2 pairs x, y and for any j x , j y e [r,2r], 



E 



S My,r l {x ^r y] 



<V- l m(E\B(r )\fa 3 ' 4 . 



Proof. Part (2) of Lemma l6.4l with n = ro + 2r and a straightforward calculation with Theorem l4.5l and 
our choice of parameters shows that 

£E[S; W - ro l 2r 0+jx+jy }<CVm{E\B{r Q )\) 2 [a m e 8M + a m e 4M ], 

x,y < — > y} 

which gives the result since Ca m e 8M < a^ 4 by our choice of M in H2.7D . □ 

Proof ofTheorem l6.2l Recall the requirements (1) - (5) in the definition of srf [x, y, j x , j y , ro, /3, k) . We ap- 
ply Lemma lBTTl and let ki, . . . , kg be the corresponding radii. We prove the theorem with radii {ji,...,j(} 
defined by 

; ; - = ki + ke~ l l2, 

for i = !,...,£ and assume x, y are such that the assertion of Lemma EE holds. We will prove that for 
these pairs x, y 

p\si{x,y,2r,2r) and Q {(q) does not hold for j x >jy\\ = o{e 2 ) , (6.6) 

jx>jy£ijl,-,je} 

for q g {1,2,3,4,5}. We do this in the order (1), (2), (4), (5) and (3). Since sd{x,y,2r,2r) c sd{x,y, j x , j y ) 
when j x ,j y <2r, 1 16.61) holds trivially for q = 1 and all x,y, j x ,j y <2r. 
For any j x e {ji,...,j £ }, 

P{^(x,y,j x ,j y ) and \B x (j x )\ > <T 2 (1 + e) 3r ) < Ce 2 {\ + e)~ r = 0{e~ M e 2 ), 

by the Markov inequality, Lemma l4~4l the BK-Reimer inequality and Corollary l3.5[ This implies that 

P{^(x,y,2r,2r) and 3j x e {ji,...,j ( } such that \B x (j x )\ > e~ 2 {\ + ef) = o(e 2 ), 

since £ = o(e M ). Similarly, 

P{^(x,y,j x , j y ) and \dB x (j x )\ > e^U + e) 3r ) < Ce 2 {l + eY r = 0(e" M e 2 ), 

leading to the same bound. This proves 1 16.61 1 for q = 2. 
Next, we wish to show that for any j x e{ji,..., j e }, 

P{^(x,y,j x ,j y ) and x is not (l,/3,; x , r )-fit) = 0{e 2 oj]^). (6.7) 

It is tempting to use the BK-Reimer inequality here, however, we cannot claim that the event in 1 16.71 1 
implies that dB y {j y ) ^ occurs disjointly from the event x is not {l,fi,j x , r )-fit, since they are both 
non-monotone events and the corresponding witnesses may share closed edges. Instead, we condi- 
tion B x {j x ) = A and get that 

P(^(x, y, j x , j y ) and x is not (1,0, j x , r )-fit) = £ P(B*(;J = A)P(3B y (; y ) ^ off A) , 

A: x is not (l,^j x ,r )-fit 

since (1, /3, j x , r )-fit is determined by the status of the edges touching B x {j x ). We use Corollary l3.5l to 
bound P{dBy{jy) ± off A) = 0{e) and 

P{dBxijx) * and x is not (1, f>, j x , r )-fit) = P(dB x (j x ) ? 0) - P(dB x (j x ) * and x is (1, f>, j x , r )-fit) 

<P(dB x (j x )^0)O(u) 1 J l 5 ), 
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by our choice of radii in Lemma l677l so Corollary l3.5l gives I I6.7I I. Therefore, 

P(,e/(x,y,2r,2r) and3; x e such that x is not (l,p,j x ,r )-Gt) = 0(^e 2 w^ 5 ) = o(e 2 ), 

by our choice of £ in (16. X D . of M in 112.71 1, and of o) m in 15.11 . This proves 1 16.61 1 for g = 4. 
Similarly by Lemma lBTBl and Markov's inequality for any j x , j y , 

P(.saf (x, y, j x , j y ) and (5) does not hold) < Ce 2 a^ 4 . 

The union bound implies that 

P(«k/(x, y,2r,2r) and 3j x , j y e ...,;>} (5) does nothold) = 0(/ 2 a^V) = o(e 2 ), 

by our choice of £. Therefore, 116.61 1 holds for g = 5. 

Thus, it remains to prove 1 16.61 1 for q = 3. This is the difficult requirement in which we only prove 
that one of the radii in {jx, . . ., jg} satisfies it (in fact, all radii satisfy it, but that is harder to prove, and 
we refrain from doing so). In the same way as in the proof of 1 16.61 1 for q = 4, using Corollary l3.51 it is 
enough to show that 

V{dB x {2r) * and \dB x {j x )\ < e^V 1 Vj x e {ji,...,jg}) = o(e). (6.8) 

For i g {!,...,£}, we write s^i for the event that xis [5,(3, ki, fce _1 )-fitand5g; for the event that \dB x {ji) \ < 
£ -i e fc/4 an( j eg/. f or even ^ 

2i = {\dB x Uc t )\>(e- 1 Vfe{l,...,i}}, 

so that 

V{dB x {2r) * and \dB x {j x )\ < P(®, n^n-n^) + P({dB x (2r) ± 0} n @|). 
Then we can split 

P(@^ n^in-n^)< P(@^ n + P(®^ nain-n^nrf/). 
By our choice of fcj in Lemma lBTTl and Corollary l3.5l we have that P[3>g n .e/f) < £w^ 5 , so that 

P(®^ nJin--n 3B e ) < eco 1 ^ 5 + P[SBg \ ®e n 5§i n • • • n n ^)P(@^_i nJin-n ^_i) . 
Thus, by Lemma lBTHl 

P(@^ nJin-n^)< eco)^ + (1 - cf)P(®/-i n^n-n ^_i) , 
By iterating this we obtain 

P(S> e nSBin---nSB e )< e£a> 1 J l 5 + Ce(l - c(/ = o(e) , 

since ^w^ 5 = o(l) and C _1 = o(/) (recall ED), and P(@i) < Ce by Corollary[33J Lastly, Lemma[63] 
shows that 

P({dJJ x (2r)j*0}n®2 = o(e), 
showing 116.81 1 and thus concluding the proof of 1 16.61 1 for g = 3 and the proof of Theorem l6.21 □ 

6.3. Proof of Theorem l6.3t Conditional second moment. We now set the stage for the proof of The- 
orem [631 We perform this by a conditional second moment argument on Sj x j ro [x,y). We will be 
conditioning on B x {j x ) = AandB y (; y ) = B where A and B are such that the event ^[x,y,j x ,j y ,ro,p, k) 
holds. We abuse notation, as before, and treat A, B as sets of vertices but our conditioning is on the 
status of all edges touching B x {j x - 1) and B y {j y - 1). Thus, while A and B are disjoint sets of vertices, 
they may be sharing closed edges. With this in mind, we generalize the notation just before Lemma 
16.61 and write P^, V B and Pa,b for the measures 

P A (-) = P[-oSA\B x {jx) = A), 

P B (0 = PO Off-BI By(jy) =E), 

P AB 0) = P(- off AuB\B x {j x ) = A, B y Uy) = B). 
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We start by proving five preparatory lemmas. 

Lemma 6.9. Assume that A,B are such that x,y are (l,/3,j x ,ro)-ftt and {l,f},jy,r )-ftt, respectively. 
Then, 

a,bedAxdB [u,u') 

Proof. Let a £ dA be a y' x , r ) -regenerative vertex. Then, by definition, 

Y j P A {a P J^^u)>a-m\B{r )\. 

u 

Denote by U the set of vertices 

U={u:P A {a^^u)^a-p m )V- 1 EiBml 
and recall that Lemma l3T5l guarantees that 

_ , P[2m„,r ],x , (l + oOS))E|B(r )| 

P A [a < ► u) < , 

by our choice of f> in I l6.lt . so that 

(l-/3)E|fl(r )| < Y,P A (a Pl2m °' ro] '> u) <\U\{\- j8 1/2 )V- l E|fl(r )| + (V- \U\)H + o(/3)) V^EIfltro)!, 

u 

and we deduce that | U\ <2/3 ll2 V. In other words, for at least (1 -2/3 ll2 )V vertices u, 

_ , P[2mo,r ],x , (l-/3 1/2 )E|SCr )| 
P A [a - - u) > . 

Similarly, for any bedB which is (/3, j y , r ) -regenerative, there exist at least (1 — 2/3 1/2 ) V vertices u such 
that 

n P[2m 0l r ],y . (1 - j8 1/2 )E|fl(r )| 

P B { b < > U )^ y • 

Thus, for such a and b, at least (1 - 4/3 ll2 )V vertices u satisfy both inequalities. Write D for this set 
of vertices so that \D C \ < A(5 ll2 V. Since the degree of each vertex is m, the number of edges having at 
least one side in D c is at most Afi ll2 Vm. Thus, at least (1 - 8/3 ll2 )Vm directed edges [u, u') are such 
that u and u' both satisfy the above inequalities. Hence 

_ , P[2mo,r ],x ^ f1 P[2m Q ,r ], y „ (1 - 8/3 1/2 ) (E|B(r ) |) 2 m 
2^ P A [a — u)P B [b < u J > . 

iu,u') V 

Since x is (1, f>, j*, r )-fit and y is (1, /3, j y , r )-fit the number of such pairs a, b is at least 

(IdAl-e-^ddBl-e- 1 ), 

and the lemma follows. □ 
Lemma 6. 10. The following bounds hold: 

£ V A>B {a P ^^ u,a^ Zl )P B {b^ Zl )P B { Zl r ^ u^^a^ldAWdBW-'miEmro)]) 2 , 

{a,b)£dA*dB 
[u,u'),zi,ti£[mo,r ] 

and 

£ P^Cfl&^J M ,a^zi)P B (fc^i zi)P B (zi ~ u') < a^iaAlia^iy^mCEISCro)!) 2 . 

(M,u'),zi,fie[TOo,;'o] 
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Proof. The proof of the second assertion is identical to the first, so we only prove the first. If a *■ [2mo ' r °J 
u and a*^* Z\, then there exists z 2 and t 2 e [m , r ] such that 



or there exists z 2 such that 



{fl — * Z 2 } o {Z 2 « ► U} o {z 2 ^— Zi} 



m P[m ,r ] P[m ,r ] 
{a <— Z 2 J ° {^2 « o {z 2 ' Z\} . 



To see this, let r\ be the lexicographically first shortest open path between a and Z\ so that I77I < r and 
let 7 be an open path between a and w such that |y| e [2m , r ]. Let z 2 be the last vertex (according to 
the ordering induced by 7) on 7 belonging to r\ (that is, the part of 7 after z 2 is disjoint from rj). Let r 2 
be the distance between a and z 2 along rj. If f 2 > mo, then the first event occurs: the first witness is 
the first t 2 open edges of 77 together with all the closed edges in the graph, the second witness is the 
set of open edges of 7 between z 2 to u (note that there are no more than r - t 2 edges since the part of 
7 between a to z 2 is of length at least t 2 ) and the third witness is the set of open edges of rj between z 2 
and u. If t 2 < m occurs, then the second event occurs by a similar reasoning. 

This leads to two different calculations, we use the BK-Reimer inequality and get that the required 
sum is at most S (fl) + S lb \ where 

S la) = £ P AB (fl — Z 2 )V A , B {Z 2 £3 U)P AB (Z2 ~ Z X )V B {b ^ Zi)P B (Z! ^ u') , 

{a,b)edAxdB,{u,u'), 
zi,Z2,h,t 2 e[m ,ro] 

and 

c(tt V" n 1 m o ,n /■ p [ m o.'"ol , ^I^o^o] , =fi r -fi , 
S W_ ^ p Afi ( a< _^ Z 2 )V AtB {Z 2 < '«)P J 4,5(Z2 < *Zi)V B (b<-^ Zi)P B Ui^ 

(a,£>)edAxdB,(M,M'), 
zi,z 2 ,fie[mo,r ] 

We use Lemma l3T5l together with Lemma l4~4l to bound the terms V A , B [a z 2 ) and V B {b *^ z\) in S ia) 
by CV~ l (1 + e) f2 and CF'a + f)' 1 , respectively. This gives 

S M <CV~ 2 £ d + e) fl+f2 £ P AB (z 2 r ^ M )P AB (z 2 ^ 2 z 1 )P AB (z 1 ^ 1 M '). 

(_a,b)edAxdB,{u,u'), z i' z 2 
fi,f 2 e[mo,ro] 

We sum over Zi, z 2 using Corollary l3.18l together with Lemma lOl to get that 

£r" 3 (l + c) 3r °" fl " 2f2 



S ( fl) < cy -2 £ (l + e) h+t2 

[a,b)edAxdB,{.u,u'), 
fi,f 2 e[m ,r ] 



C£ m + ■ 



V 



< Ca m m\dA\\dB\ V~ l e~ z {\ + e) 2r ° + C\dA\\dB\ V~ 2 me~ A {\ + ef r °r , 
where the last inequality is an immediate calculation. By Theorem l4.5l the first term is at most 

Ca m \dA\\dB\V- l mmB{rv)\) 2 , 
and by our choice of r in 112.81 1 . the second term is at most 

m * |aA||aS|V- 1 m(E|B(r )|) 2 . 

Ve 3 ? 

This gives an upper bound on S (a) fitting the error in the assertion of the lemma. To estimate S m we 
use Lemma lHTTBl to bound P A , B (z 2 ^ [m °' r °} u ) a nd V AtB {z 2 ? [m °' r °} z\). This gives 

S M <C -2 2- P A , B {a*-> z 2 )V B {b<- ►Zi)P B (zi < — *u). 

{a,b)£dAxdB,(u,u'), 
zi,Z2,helm ,r ] 
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We now sum over (u, u'), Z\,Z2 and t\ using Lemma l4~4l We get that 

S [b) < C\dA\\dB\V- 2 m{E\B{r Q )\) 2 r moe~ l {\ + E) ro < Cm °f^ C\dA\\dB\V~ l m{E\B{r Q )\) 2 , 

by Theorem l4.5l and 112.81 1, concluding our proof since ra = o(£r _1 ). □ 
Lemma 6. 1 1 . The following bounds hold: 

£ P^Ca&^J i/)Pa,b(«' ~ ^i) p s^ ~ «i)PbUi — u') < a^ 2 e|5A| 2 |dfl| V-Wl^o)!) 2 , 

(a,b)edAxdB,a'edA 
(m, u'),z\,t\E[mo,ro\ 

and 

£ PaM"^ 2 ^ u)P A , B {a! ~ Zi)P B (fc 2=S z x )P B ( Zl ^ u') < a^ 2 e|5A| 2 |dfl|V- W|fl(r )l) 2 . 

(a,b)edAxdB,a'edA 
(m, u'),z\,t\E[mo,ro\ 

Proof. The proof of the second assertion is identical to the first, so we only prove the first. We use 
Lemma EES to bound P A<B {a ^ l2mo ' r "> u) < CV- l E\B(r )\ and P B (b ~ z x ) < CV^U + e)' 1 . We then 
sum P B {zi ^— -» m') over (w, i/) to obtain a factor of Cm£ _1 (l + e) r °~ fl by Lemma I4~4l We now sum 
Pab(«' < -^ > Z\) over zi and get another E|B(r )| factor and now sum all this over a, b, a', t\. This gives a 
contribution of 

C\dA\ 2 \dB\V- 2 mr (E\B(r )\) 3 , 

by Theorem l4.5[ This is at most 

Ce\dA\ali 2 loe(e 3 V) „ „ , , 
Ve^V 7 

by our choice of r in 1 12.81 1 and Lemma lOl concluding our proof. □ 
Lemma 6.12. The following bounds hold: 

£ P A (a ~ ZiJPaUi — u)P B (fc£^^! u') < C|<9A||aB||B|V- 2 mro(E|B(r )|) 2 , 

a.fcedAxdB 
(M,M , ),zieB,fie[mo.''o] 

and 

£ P A (a 2=S zi)P A (zi ^ u)P B (k^^ u') < C|aA||aB||5|y- 2 mro(E|B(r )|) 2 . 

a.ftedAxdB 
(u,M'),zieB,fie[m ,/"o] 

Proof. The proof of the second assertion is identical to the first so we only prove the first. We use 
Lemma l3.13l to bound 

p B{b PJ2rn^ ul) < (1 + o{1))V -l Emro)l 

and, as before, we use Lemma l3. Al together with Lemma l4~4l to bound 

P A (a~ Zl )<CV- l a + s) h , 
sum over u' such that (u, u') e E{G), and finally use Lemma l4~4l to bound 

^P A (z 1 ^ l /)<Ce- 1 (l + £)'' - fl . 

u 

Altogether, after summing over a e dA, b e d-B, zi g B,t\< r , this gives the bound of 

C\dA\\dB\\B\V~ 2 mr e~ l {l + e) ro E\B{r )\ < C\dA\\dB\\B\V~ 2 mr (E\B(r )\) 2 , 
where we have used Theorem l4.51 □ 
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Lemma 6.13. For any positive 6>0 and /3 > 0, 

where 

En<C\dA\\dB\V~ l m(E\B(r )\) 2 



r (U4| + |£|)V _1 + < 2 (l + e|dA|) 



Proof. We have that 

E AiB Sj xi j yt r {x,y) = E E v a,b{{ci ?l l2m °' ro] '* u} and {b f [2wo ' rol f u > ff B x {j x + r )}) , (6.9) 

{a,b)£dAxdB {u,u') 

because the additional requirement that a and b are pivotals in the definitions of {a ^ [2OTo,r ° ], f M j an( j 
{b *■ [2m ° ,rol '-r M 'j implies that they are unique in dA x dB, so no pair (a, b) is overcounted in the sum. 
We define Bg A {r ; AuB) = u a f e g A B a >{r ; Au B). We condition on Bg A {r ; A u B) = H for an admissible 

H (that is, ai 
116.911 equals 



H (that is, any H that has positive probability and a * [2mo ''~ ol, f M ff g occurs in it). Each summand in 



E p AB (5aA(r ; AuB) = H)V AtB [b P , [2m ° M ' y , u ' off H \ B dA (r ; A u B) = h) , 



P[2m Q ,r Q ],y 

V A ,B(BdAW,/lU£}) =tl)V AiB \B 

H 

and we have 

( P[2m ,r ],y , CCTT . n , . TT \ n fl P[2m ,r ],y , cc . 

Pa,b 1^" « off if I 5^ (r ; AuB) = H = P B [b< — u off AuH), 

because in both sides the status of the edges touching Au H cannot change the occurrence of the 
event. This gives that 

^A,BSj x ,j y ,r {x,y)= £ Y L p AMBdA(r ) AuB) = mP B (b^^u' off AuH). 

(a,b)edAxdB{u,u') H 

Now, by ClaimEU 

P[2m ,r ],y , P[2m ,r ],y , P[2m ,;' ] , , . , , „. 

Pb(o < >-w off AuH) > Pb(o < ►m)-P_b(oi ► u only on Au H) , 

where in the last term we have dropped the requirement that y is pivotal (which only increases the 
probability). Hence by summing on H we get, 

rr c r w V" V" n r P[2mo,r ],X P[2m ,r ],y . 
E A,BSj x ,j y , ro {x,y)> £ L p A,Bia< -M)P B Cb«- -M)-S 2 , 

{a,b)£dAxdB (u,u r ) 

where 

S 2= £ E E P AB(5 M (r ;Au5) = H)P B (fe^ 2 ^u'onlyonAuH). (6.10) 

(a,b)£dAxdB(u,u') H 

As before, 

P[2m ,r Q ],x P[2m ,r ],x 

Pa,b(^ < * w) = Pa(^ < * woffB), 

since the status of the edges touching A u B in both sides does not matter. Claim l3Al again gives that 

_ , P[2m ,r ],x 

V A {a *- ► u c 

and so we may further expand 



P[2m ,r Q ],x P[2m ,r ],x P[2m Q ,r ] 

P A (a*- >uoffB) > V A {a^~ -u)-P A (a*- w only on £) , 



^A,BSj x ,j y ,r (x, y)>Si-S 2 -S 3 , 

with 

c „ , P[2m ,r ],x ,, P[2m a ,r a ],y u 

Si= £ £ p a(« < *u)V B {b< ►!<), 
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s 3= L L p ^( a< - uonlyonB)P B (fo^ >u), 

{a,b)£dAxdB [u,u') 

and S2 is denned in 16.101 . Lemma l(x9l gives the required lower bound on Si which yields the positive 
contribution in the assertion of this lemma. We now bound S2 and S3 from above, starting with S3. If 

a ^ [2mo, '°} u only on B, then either a 2 ^ u or there exists Z\ g B and t\ g [m , r ] such that 

{u * — ► Z\} o {z\ *— — ► u] or {a< — ► z\}o{z\ <— - u}, 

Indeed, let 7 be the lexicographically first shortest path between a and u. If |y| < 2m , then a ^2 u , 
otherwise |y| g [2 mo, To] and we take Z\ to be the first vertex in B visited by 7 and ? is such that 7(f) = z\. 
If t > m , then we put f] = ? and otherwise we put t\ = I7I - t. In any case ti g [m , r ]. When f > m , 

the witness for a z\ is the set of open edges of the path 7(0, t] together with all the closed edges of 

the graph and the witness for z\ ^— -i a are the open edges of y[t, |y|]. The case t< mo is done similarly. 



We get that 



S3 < 2^ P^fa— * M)Pfl(fc«- *U) 



{a,b)£dA*dB 
(u,u'} 



+ 2^ P^(fl zJPaUi ^ — ►U)P B (Z?^ *U) 

[a,b)£dAxdB 
{u,u'),zi£B,ti£[mo,ro] 

+ 2^ Pa(0* " Zi)P A (Zl <— U)P B (&< ►!/). 

(«,M'),zieB,fie[mo,ro] 

For the first term we bound Pg (£> ^ [2mo ' r °> z/) < CV _1 E|i?(ro)| by Lemma l3 . 1 3 1 and sum over everything 
to get a contribution bounded by 

C\dA\\dB\mV- l <E\B{r Q )\m Q < C\dA\\dB\V- l m{miro)\) 2 [momB{r Q )\Y 1 } 

< Ca^iaAliasiy-^CEISCro)!) 2 , (6.11) 

by our choice of r in 1 12.81 1 . our assumptions a m > (e 3 l/)" 1/2 in 1 12.51 1 and m = o(e _1 ) and Corollary 
14.61 This fits in the second term of Err in the assertion of the lemma. We bound the second and third 
terms using Lemma lB. 121 giving an upper bound of 

C\dA\\dB\\B\V- 2 mr (E\B(r )\f , 

which fits in the first term of Err in the assertion of the lemma. 

We proceed to bound S2 in H6.1G1 from above. As before, if b *■ [2m °' r °} u ' only on Hu A, then either 
b ^ u' or there exists Z\£ Hi) A and t\ g [ra , r ] such that 

{b^ Zl }o{ Zl r ^u'} or {b r £^z x }o{ Zx ^u'}. (6.12) 

The case b u' is handled as before and gives a contribution of C\dA\\dB\mV~ 1 E\B(r )\m which 
by 116.1111 again fits the second term of Err. To handle the other cases, let us first sum the contribution 
to S2 due to (16. 12D over Z\ g H. We use the BK-Reimer inequality and change order of summation to 
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bound this contribution to S 2 by 

£ PA,B(a ^ [2mo ' r °' u,3a! g dA such that a' zi)P B {b *^ zi)Pb(zi 2— ^ u') 

{a,b)£dA*dB 
(u,u'),zi,tiE[ma,ro\ 

+ £ Pa,b {.a P[2m °' r °} u,3a' e dA such that a' zi)P B (b ^ z\)Pb {zi u') . 

{a,b)edAxdB 
[u,u'),zi,ti<ro 

Now, if a p[2m °'' } u an( j there exists a! e dA with a' *^* Z\, then either a p[2mo,r °} u> a JX z\ or there 
exists a' £ dA such that {a *■ [2m °' r °) u } {a' zi}. Hence we may bound this from above by (I) + [II) 
where 

(/) = 2^ Pa,b(« < ► w,a<— zi)P B (fr<— zi)Pb(zi < — 



[a,b)£dAxdB 
(u,u'),zi,ti£[mo,ra] 



+ L p AB( a< ► u,a*-^ zi)V B ib< — ►z i )P B (zi «— u), 

[a,b)£dAxdB 
[u,u'),zi,ti<ro 

[ID = E P A Ma P ^^ u)P A Ma' ^ z^Psib^ Zl )P B ( Zl r ^ u') 

{a,b)edAxdB,a'edA 
(u,u'),z\,t\E[mQ,ro\ 

+ E Pa,b(« < > u)P a ,b{ci zi)P B {b< — ► zi)P B (zi <— u ). 

(a,b)edAxdB,a'edA 
(M,u'),zi,fi<ro 

Lemma [6.101 readily gives that (I) < a^ 2 |3yl||35| y _1 m(E|6(ro)|) 2 which fits into the second term of 
Err. Lemma[6in] gives that (17) < a]^£\dA\ 2 \dB\V~ l m{E\B{r Q )\) 2 which fits in the second term of Err. 
We sum the contribution to S 2 due to H6.12I I over Z\ e A and bound it from above by 

£ P A b(«^^ u)P b &~ Zi)P b (zi ~ u') < C|aA||aB||yl|y- 2 mro(E|B(ro)|) 2 , 

(a,b)edAxd£ 
(w, w'),zieA,tie[mo,ro] 

by an appeal to Lemma lB.121 This fits in the first term of Err and concludes our proof. □ 



and 



Ea,b S jx> j ro (x, yfl 2 ,- < Qi + Q 2 + Q 3 , 

:d\y-dlV, 



Lemma 6.14. The following bound holds: 

,jy> 

where 

Qi = [l + O(a m + em ))V- 2 m 2 {E\B(r )\) 4 \dA\ 2 \dB\ 2 , 

Q 2 = CV- 2 m 2 e- l (E\B(r )\) 4 \dA\\dB\(\dA\ + \dB\), 

Q 3 = CV- 2 m 2 e- 2 (E\B(r )\) 4 \dA\\dB\. 

Proof. Assume that [u\, u[) and [u 2 , u' 2 ) are two edges and let a, a\,a 2 be vertices in dA and b, b\,b 2 
vertices in dB. Define 

._, , . Pl2m ,r ] P[2m ,r ] P[2m ,r ] , . P[2m ,r ] . 

zr{u\,u 2 ,ai,a 2 ) = {ai < ► ui}o{a 2 < ► u 2 }, 2T{u\,u 2 ,a) - {a< ► ui}n{a< ► u 2 }. 

(6.13) 

We define ST{u\, u 2 , b\, b 2 ) and 3~{u' v u' 2 , b) in a similar fashion. 

Now, if (mi, u\ ) and {u 2 , u' 2 ) are counted in Si j v r ix, y) 2 l 2ro , then one of the following events 

{dA-^dB} 

must occur off A u B: 
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(1) There exists a\, a 2 ,b\,b 2 such that 3~{u\, u 2 , a\,a 2 ) °3~(u' v u' 2 ,bi,b 2 ) occurs, 

(2) There exists a\,a 2 ,b such that 2T{u\, u 2 , a\, a 2 ) o 3~{u[, u' 2 , b) occurs, or the symmetric case 
3~{u\, u 2 , a) o 3~ {u\, u' 2 ,bi,b 2 ). 

(3) There exists a,bsuchfaat^(ui,u 2 ,a)o&~(u' v u 2 ,b) occurs. 

See Figure [5] Observe that the disjoint occurrence of the events is implied since dA dB. We now 
sum the probability of these events over {u\, u'J, [u 2 , u' 2 ) and this gives us three terms which we will 
bound by Qi, Q 2 and Q 3 , respectively. By Lemma lHTTBI and the BK inequality, 

„ (ari {l + O(a m + em ))(E\B(r )\) 2 

P AB {T{u 1 ,u 2 ,a 1 ,a 2 ))<- ■^ 2 — L , (6.14) 

whence 

X V AtB {3-{u x , u 2 , a x> a 2 )o3-(u' v u' 2 , bM) < (l + 0{a m + em )) V- 2 m 2 (E\B(r )\f\dA\ 2 \dB\ 2 , 

ai,ci2,bi,b2, 
{Ui,u' 1 ),[U2,uL) 

which equals Q\. To bound the probability of (2), if 3~{u[, u' 2 , b) occurs, then, as before, there exists a 
vertex z\ and t\ e [mo, r ] such that 

{b «— • Zi) o {Zi « ► U x } o {Zi ^— U 2 \ , 



or there exists z\ such that 
Hence, the BK-Reimer inequality gives that 



n rn . P[m ,r ] P[m ,r ] , 
{O <— * Z\] o {Zi ► U x \ o {Z\ <- — U 2 } 



£ P A , B {3-{u[,u' 2 , b)) < £ Pa,b(&~ si)PabGzi — «i)P^,flUi — 4) 



tie[mo,r ] 



+ L p a,b(&~.zi)PabOzi < "iJPa.bUi'' -w 2 )- 

We estimate the first sum by summing on u' 2 , u\ then on z\, h using Lemma l4~4l to get a bound of 

C£" 3 (l + £) 2ro ^Ce-^EIBfro)!) 2 , 
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by Theorem l4.51 and the second sum is bounded by Cmo(E|B(r )|) 2 which of lower order since emo- 
o(l) by H2.9I I. We use the BK inequality and D6.14I I to bound the contribution due to the first event in 
(2) by 

X V AtB {3-{u 1 ,U2,a l ,a 2 ))V AtB {3-{u[,u' 2 M)<CV- 2 m 2 E-\m{r Q )\) i \dA\ 2 \dB\. 

a\,a2,b,(u\,ii^), 
{U2,u' 2 ) 

The symmetric in (2) obeys the same bound with the roles of |<9^4| and \dB \ reversed. This contribution 
equals Q 2 . To bound the contribution due to (3), we note that 

£ P AiB (3~(ui, u 2 , a))P AiB {3~ \u' v u 2 , b)) , 

a,b,(u\,u' l ),(U2,u' 2 ) 

is bounded using the BK-Reimer inequality by the three sums 

£ P AtB {a^ zi)P a ,b(zi i ui)P A>B (zi r ^—^ u 2 )P A , B {b^ z 2 )P AyB {z 2 ^— * u[)P AtB {z 2 2— 2 u' 2 ), 

a,b,[ui,u[),{u2,u' 2 ) 
zi,Z2,ti,t 2 e[m ,r ] 

Zt. r =f i mi r r o-fi , rl , T a -h , n n m Q P[m ,r ] , P[m ,r ] , 
P A , B {a^ zi)P A)B (zi — ui)V A}B {zi — i/ 2 )Pa„b (*>«-• 2 2 )Pa,b(2 2 - — — u^P^gfe « ► u 2 ), 

a,b,(ui,u' l ),{u2,u' 2 } 
zi,Z2,h£[mo,r ] 

and 

Z n , m P[m Q ,r ] P[m ,r Q ] m P[m a ,r ) , P[m ,r Q ] , 
V A>B {a*-^ zi)V AB {zi < ► ui)V A>B {zi < ► u 2 )Y A>B {b*-^ z 2 )V A>B {z 2 < ► u x )V A>B {z 2 < ► u 2 ) . 

{U2,U 2 ),Zi,Z 2 

To bound the first sum, we use Lemma l3. 12l and Lemma l4.4l to bound P a r f a <— > Zi) < CV _1 U+£) fl and 

Pa,b(^ z 2 ) < CV _1 (l+e) tz . We then use Lemma l3. 19l and Lemmal4~4lto sum over Z\,z 2 , {U\, u[), (u 2 , u' 2 ). 
This gives us an upper bound of 

CV~ 2 \dA\\dB\m 2 e~ 6 {l + ef ro + C\dA\\dB\V~ 1 m 2 m a m £ (l + c) fl + f2 

ti,t 2 e[m ,r ] 

< CV- 2 \dA\\dB\m 2 e~ 2 (E\B(r )\f , 

where the last inequality is due to Theorem [43] and our choice of r in ( 12.8ft . This is contained in Q 3 . 
To bound the second sum, we use Lemma I3TT51 to bound each of the last two terms by CV~ 1 E\B{r )\. 
We then sum over {u\, u\) and {u 2 , u' 2 ) using Lemma l4~4l We then sum over Z\,z 2 using Lemma l4~4l 
and finally over a, b, t\ to get that this sum is at most 

C\dA\\dB\V~ 2 m 2 (E\B{r )\fe~ 1 m , 

which is contained in Q 3 since em = o(l). For the third sum we use Lemma l3.15l four times, and then 
sum over everything to get a bound of 

C\dA\\dB\V- 2 m 2 (E\B(r )\fm 2 , 

which is also contained in Q 3 , concluding our proof. □ 

Proof of Theorem l6.3l Instead of conditioning on J{x) = j x , J{y) = j y we condition on B x {j x ) - A and 
By(jy) = B such that the event sd (x, y, j x , j y , r , j8, A;) holds. This is a stronger conditioning and implies 
the assertion of the theorem. 

By requirement (2) of sd {x, y, j x ,j y , r , ft, k) and our choice of parameters 

roQAl + lBDV' 1 < I/ _1 c- 3 log(£ 3 V)e 3M < (log^V) -1 , 
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and 

Hence the error term in Lemma l6.13l is at most 

Err< C[{\o%e 2 Vy l + a)^\\dA\\dB\V- l mmB{r Q )\) 2 . 

Lemma l6.13l together with requirement (5) in the definition of si {x, y, j x , j y , ro, p, k) and our choice of 
P in ([67D (in particular, that p « a 1 ^ 4 a (loge 3 ^)" 1 by (277)) give 

E AiB \s jxrjY , ro (x,yn 2ro ]>a-Cp m )V- 1 miE\B{r )\) 2 \dA\\dB\. 

L " {dA^dB} i 

Since \dA\ and \dB\ are at least e kl4 e~ 1 , 

e~ l \dA\ 2 \dB\ + e~ l \dA\ 2 \dB\ + e~ 2 \dA\\dB\ < Ce~ kl4 \dA\ 2 \dB\ 2 , 
hence, by Lemma l6.14l and our choice of parameters, 



Sh,j Y ,r (x,yfl 2r )<{\ + 0{e- hl4 ))V- 2 m 2 mB{roW\dA\ 2 \dBV 

{3A/*dB} 



Ea,b 

We conclude that 

VAji{S] x ,j y ,r ix,y) >2p ll2 V- l mmB{r Q )\) 2 \dA\\dB\) > l-0{p V2 ), 
where we used the fact that e~ kl4 = o{P) and 14.41 1 . This concludes our proof. □ 

7. Proofs of main theorems 

7.1. Proof of Theorem 1 1.31 In Section IZ41 we already proved Theorem ll.3l fa) hence we may assume 
that the finite triangle condition 11.51 holds and focus on part (b) of the theorem. Since l^il < fco + Z>jt Q 
where ko is from Theorem l2.2[ Lemma l231 immediately gives that l^il < (2 + o{l))eV whp, showing the 
required upper bound on | — note that this argument only uses the finite triangle condition hence 
it is valid for any e m satisfying e m » V~ 113 and e m = o(l). For the lower bound we will additionally 
assume, as part (b) requires, that e m = oim^ 1 ) and show that 

Pp(|^il > (2 - o(l))eV) = 1 - o(l) . (7.1) 

This establishes part (b) of Theorem 1 1.31 Recall that p = p c {\ + e) is our percolation probability, let 
9 > be an arbitrary small constant and put p 2 , p\ to satisfy 

p 2 = 6e/m, p c (l + e) = pi + (l-pi)p 2 , 

so that p c {l + (1 - 6)e) < p\ < p c (X + £) since p c > 1/ra. Denote by G pi and G P2 two independent 
percolation instances of G with parameters p\ and p 2 , respectively. The sprinkling procedure relies 
on the fact that G p is distributed as G Pl u G P2 . We first apply Theorem l2.4l to G Pl and deduce that for 
M, r defined in 112 71 and ro defined in 12.811 . 

P P1 [P nro > (1 - 36He 2 V 2 ) > 1 - o(l) . (7.2) 

Now we wish to show that when we "sprinkle" this configuration in G Pl , that is, when we add to the 
configuration independent p2-open edges, most of these vertices join together to form one cluster 
of size roughly 2eV. To make this formal, given G Pl , we construct an auxiliary simple graph H with 
vertex set 

V(H) = {xeG Pl : \^{x)\ > (rV) i; V 2 } , 

and edge set 

E(H) = {(x, y) e V{H) 2 : x, y are (r, r )-good} . 
Thus, using Lemma lZ3l with k = e~ 2 {e 3 V) 114 and 17.21 1, with probability at least 1 - o(l), 

\V{H)\ = {2 + o{l))eV, \E(H)\ > {l-36)4e 2 V 2 . (7.3) 
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Denote v = | V(H)\ and write Xi,...,x v for the vertices in G pi corresponding to those of H. Given 
G Pl for which the event in 17.3t occurs, we will show that whp in G Pl u G P2 there is no way to partition 
the set of vertices into Mi\±)M 2 = {x\,...,x v } with \M\ \ > 3d v and |M 2 | > 36 v such that there is no open 
path in G Pl u G P2 connecting a vertex in Mi with a vertex in M 2 . This implies that whp the largest 
connected component in G Pl u G P2 is of size at least (1 - 36) v. 

To show this, we first claim that the number of such partitions is at most 2 3 ^ e3v)3 ' 4 since |^(X;)| > 
(£ 3 V) ll4 e~ 2 . Secondly, given such a partition, we claim that the number of edges (u, u') such that u e 
Mi and u' e M 2 (note that, by definition, these edges must be pi-closed) is at least e~ 40M [\ogM]~ 1 6e 2 Vm. 
To see this, we consider the set of edges in H for which both sides lie in either Mi or M 2 (more pre- 
cisely, the vertices of H corresponding to Mi and M 2 ). This number is at most 

M\ + M|< (36v) 2 + (l-3d) 2 v 2 < (1-56) v 2 , 

where we used the fact that 6 > is a small enough constant, Mi + M 2 = v and both Mi and M 2 are in 
[36 v, (1 - 3d) v]. By 1 17.31 1, the number of edges in H such that one end is in Mi and the other in M 2 is 
at least 6e 2 V 2 . In other words, there are at least 6e 2 V 2 pairs {x,y) £ Mi x M 2 such that S 2 ,- +ro (x,y) > 
(logM~ 1 ^~ 1 ^£~ 2 (E|6(r )|) 2 . Note that is a large number due to our condition 12.811 . In total, we 
counted SLt\eastde 2 V 2 ■ [logM)~ 1 V~ 1 me~ 2 {E\B{ro)\) 2 edges {u, u') and no edge is counted more than 
\B u (2r + r )| • |5 u /(2r + r )| times, which is at most e 40M fr" 2 (E|B(r )|) 2 by the definition of S 2r+ro (x, y) 
and the second claim follows. 

Hence, if l^il < (1 - 3d) v, then there exists such a partition in which all of the above edges (u, u') 
are p 2 -closed. By the two claims above, the probability of this is at most 

2 3(e 3 V) 3/4 ( 1 _ ^e- 40M (logM)' 1 eme 2 V < 2 3(e 3 V) 3/4 e -e- 4OJW (logM)- 1 2 £ 3 V _ Q ^ 

since p 2 = 6 elm and by our choice of parameters in 12.71 1 and 12.91 . This concludes the proof of 17.lt 
since 6 > was arbitrary and establishes the required estimate on 1^1 of Theorem ll.3l (b). 

We now use 17.lt to show the required bounds on E|^(0)| and 1^1. The upper bound E|^(0)| < 
(4 + o{l))e 2 V is stated in Lemma [231 and the lower bound follows immediately from our estimate on 
^i. Indeed, write for the y'th largest component. Then 

E|^(0)| = V" 1 £ E|<^(f/)| = y" 1 ^E|^| 2 > V _1 E|^i| 2 > (4-o(l))e 2 V, 

veVlG) j>\ 

where the first equality is by transitivity, the second equality is because each component ^ is counted 
I'tfjl times in the sum on the left and the last inequality is due to 17.lt . Furthermore, by this inequality 
and Lemma l231 we deduce that 

^E|^| 2 = o(£ 2 ^ 2 ), 

and hence |^ 2 | = o{eV) whp. This concludes the proof of Theorem ll.31 □ 

7.2. Proof ofTheorem ll.il In this section we restrict our attention to the hypercube and prove The- 
orem [TTTJ We begin by showing that m , defined in Theorem 11.31 with a m = m _1 logm, satisfies 
mo = O(mlogm). See Lemma [77T1 The proof of Theorem [TTTJ is then split into two cases. In the first 
case we assume that e m < 1/ra 2 so that e = o(m " 1 ) and appeal to Theorem ll.31 In the second case we 
perform the classical sprinkling argument for the case e > 1/m 2 , as done in fT71 . 

Lemma 7.1 (NBW estimates). On the hypercube {0, \} m 

T m ix(m~ l log m) = O(mlogm), 
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and for any integer L> 1 

L 

su pE E p tl {x,u)p t2 {u,v)p t3 {v,y)<0(l/m 2 ) + 0{L 3 IV). (7.4) 

x >y u,v fi,«2,f3=o 

fl + f 2 + f3>3 

Proof. We make use of the results in |24|, as we explain now. The bound on T m i x [m~ 1 \ogm) = 0(m log m) 
is EU Theorem 3.5]. We next explain how to prove 117.41 1, which will give condition (3) in Theorem ll.3l 
for L = Am log m and an appropriate A > 0. 

Let D: {0, \} m — - [0, 1] be the simple random walk transition probability on the hypercube. Our 
proof of 117.41 1 relies on Fourier theory. For convenience, we take the Fourier dual of {0, \} m to be {0, \} m . 
Then, the Fourier transform f(k) of /: {0, \} m — ► R is given by 

f(k)= Y (-D X 'VW- (7-5) 

xe{0,l} m 

with inverse Fourier transform 

/« = 77 E (-D X '7(A:). (7.6) 
^ fce{0,H m 

For the hypercube, D[k) takes the appealingly simple form 

b{k) = \-2a{k)lm, (7.7) 

where a(fc) is the number of non-zero coordinates of k. 

In EU Theorem 3.5] it is proved that, when m > 2 and f > 1, with p f (fc) denoting the Fourier trans- 
form of x—» p f (0,x), 

|p f (/c)|<max(|D(A;)|,l/v / m-l) f " 1 , (7.8) 

and p°(fc) = 1. This gives us all the necessary bounds to prove the NBW triangle condition H7.4L 

Denote the sum in H7.4H by S. The contribution to S where t\ + t2 + h = 3 equals 0(1/ m 2 ). Thus, we 
are left to bound the contribution due to ti,t 2 , h with 1 1 + t 2 + t 3 > 4. For any t > 1, 

TYl 

p\x,y)< -(D*p ? - 1 )(x,y), (7.9) 

m- 1 

where, for /, g : {0, 1} ra — ► U, we define the convolution / * g by 

(/*g)W= E /(y)g(*-y)- (7-io) 

ye{0,l} m 

Therefore, 

S < C/m 2 + 3 4 (— ^-) 4 sup Y, (D* 4 *p Sl *p S2 *p S3 )(x,y), (7.11) 

Si + S2+ S 3— 

where 3 4 is an upper bound on the number of ways we can add 4 to the coordinates of [si,S2, S3) to get 
(fi, h< £3) with t\ + ?2 + t + 3 > 4. The above can be bounded in terms of Fourier transforms as 

C L 
S< C/m 2 + - sup £ (-i)^(y-*) ^ D{k) 4 p Sl (k)p S2 (k)p S3 (k) (7.12) 

^ x >y fce{0,l} m si,s 2 ,s 3 =0 
C L 

<C/m 2 + - Y E ^(A:) 4 |p 5l (fc)||p S2 (fc)||p^(fc)|. 

17 fce{0,l} m si,s 2 ,s 3 =0 

S\+S2 + S 3 >0 

The contribution to k = equals L 3 / V since D (0) = p f (0) = 1 , and the contribution due to k = 1 (where 
1 denotes the all 1 vector) obeys the same bound. It is not hard to adapt the proof of [El Proposition 
1.2] to show that the sum over k ^ 0, 1 is 0(1/ m 2 ). We perform the details of this computation now. 
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Writing x + = max(x, 0) for xeR, and noting that there are at most 2 values of 5 for which [s— 1)+ = t, 
we obtain 

S<C/m 2 + 2L 3 /V+- X E 6(A:) 4 max(|D(A;)|,l/v^rn:) (Sl " 1)++(S2 " 1)++(S3 " 1)+ 

" te{0,l} m : fc#0,l si,s 2 ,s 3 =0 

/^p3 oo 

<C/m 2 + 2L 3 /V+— £ b ^ E max(|D(A:)|,l/v^rn:) Sl+w 

^ fce{0,l} m : fc^O *i,S2.S3=0 

, o C2 3 ^ D(A:) 4 
= C/m 2 + 2L 3 /V + £ ^— (7.13) 



We bound 



^ fce{o,i} m : /t^o.i [l-max(|D(fc)|,l/\/m- l)]' 
1 ^ D(A;) 4 

- E 3 (7.14) 

v te{o,i} m : fc^o.i [l-max(|D(A:)|,l/\/m- 1)] 



1 

te{0,l} m : fe#0,l 



< - y Dm 1 



1 

+ ■ 



I y £>(fc) 4 =2 _ w y 

V ke { o,i}kk^,i[l-\D(k)\] 3 



[l-|D(fc)|] 3 [l-l/Vm^T] 3 

We next use the fact that h L/te{o,i! m -D(^) 4 is the probability that a four-step simple random walk on 
the hypercube returns to its starting point, which is 0(1 1 m 2 ). Alternatively, and more useful for the 
proof that follows, we can write 

1 m ( m\ 

— £ D(/c) 4 = 2" m £ . (l-2;7m) 4 = m~ 4 E[(2X-m) 4 ] = 0(l/m 2 ), (7.15) 

^ fce{0,l} m j=0 \J j 

where X has a binomial distribution with parameters 1/2 and m, and we use that E[(2X - m) 4 ] = 
0(m 2 ). We use similar ideas to deal with the contribution involving [1 - |D(fc)|]~ 3 , which we rewrite as 

'm\ (1-2 /7 m) 4 

q . (7.16) 

j j[(2;7m)A(2-2;7m)] 3 

The sum 2 m X yg'[m/4,3m/4] i™) ls exponentially small in m by either Stirling's formula or large deviation 
bounds on the binomial distribution with parameters m and 1/2. When j e [ra/4,3m/4], we can 
bound l/[(2 jlm) A (2-2y7m)] 3 < 8 to bound the above sum by 0(1 lm 2 ) in the same way as in ( 17.151 . 
Together with H7.13I I , this completes the proof of H7.4H . □ 

Proof of Theorem 1 1.11 We start by proving the theorem in the case e m < 1/ra 2 . We take a m = 
m _1 logm and Lemma P7TT1 shows that mo = O(mlogm) and that condition (3) of Theorem [L3] holds. 
Condition (2) of Theorem [L3] holds by 11.21 . Condition (1) is fulfilled automatically, therefore, in this 
case Theorem ll.ll fo flows from Theorem ll.31 

We now handle the case e > 1/m 2 and e = o(l). We start by proving I17.1I I in this case. In U71 . it 
is proven that |^i| > ceV whp in this case, and the argument used there is based on isoperimetry 
together with Lemma l231 and suffices to prove the required 2e V estimate in our setting as well, as we 
show now. 

Let 6 > be a small arbitrary constant. As before, fix the sprinkling probability p2-de/m and take 
pi such that p = p c {\ +e) = p\ + (1 - pi)p2 so that p\ = p c {\ + [1-6 + o(l))e). By Lemma l2.31 whp in 
G Pl , 

2(l-29)eV<Z^ <2(l + 6)eV, 

for k Q = e~ 2 {e 3 V) 114 . As a result, there are at most 2(1 + 6)eV/k Q = 2(1 + 6){e 3 V) 3IA clusters of size at 
least k . Denote these clusters by (@ ; ) i£ /, so that |7| < 2(1 + 6)(e 3 V) 314 . 

As before, we now perform sprinkling and add the edges of G P2 . We bound the probability that 
after the sprinkling there is a partition of the clusters [3>i)iei into two sets S, T both containing at least 
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6eV vertices such that there is no path in G P2 connecting them. If there is no such partition, then the 
largest component in G Pl u G P2 has size at least (2 - 39) eV and we conclude the proof. We follow [T7J 
Proof of Proposition 2.5] . 

Since \I\ < 2(1 + 6){e 3 V) 314 the number of such partitions is at most 2 2(1+0)(e3y)3M . We bound the 
probability that given such a partition there is no /?2-open path connecting them. By [T7J Lemma 2.4], 
whenever A > 1 satisfies 

e- A2l2m <6e/2, (7.17) 

there is a collection of at least \Qem~ m V edge disjoint paths connecting S and T, each of length at 
most A. This is where the isoperimetric inequality on the hypercube is being used. Note that A needs 
to be large, in fact, we put A = m 213 and use the fact that e > m~ 2 so that J7.17II holds. The probability 
that a path of length A has a /?2-closed edge in it is 1 - p% ■ Since the paths are disjoint, these events are 
independent, and we learn that the probability that they all have a /?2-closed edge in them is at most 

(l - p*f Em ~ 2&V < e -cp^em-^V = e -c8^m^V_ ^ 
Thus, the total probability that sprinkling fails is at most 

2 2a+8)(£ 3 V) 1 - a e -c0 A e A m- 3A y _ ^a^-oWm ^ ^ 

since a > and e > m~ 2 (in fact, this argument works as long as e > e~ cm13 ). The proof of H7.1I I when 
£ » V~ 113 and £ = o(l) is now completed. 

The remaining bounds on |^i|, £1^(0)1 and |^ 2 | only rely on 1 17.11 1 and Lemma l2T3"l and are performed 
exactly as in the conclusion of the proof of Theorem ll.31 This completes the proof of Theorem ll.il □ 

7.3. Proof of Theorem ll.4i Our expansion and girth assumption of the theorem allows us to deduce 
some crude yet sufficient bounds on p f (•, •), namely, that there exists some constant q > so that 

t,n™ \ V ~ q t<C\ogV, , ti ^ |(m-l)- f f<(clog m _ 1 y)/2, 
p f (0,0)<< , and p f (x,y)<^ 

\CV~ l t>C\ogV, v^>*> W-t? f> (clog m _ 1 F)/2. 



Indeed, the second bound on p f (0,0) comes from the classical fact that r m ix(C^ _1 ) = O(logF). See 
e.g. |5, below (19)]. The first bound on p f (0,0) comes from the girth assumption, indeed, the BFS tree 
of G rooted at is a tree up to height Vgl2\ , where g is the girth of the graph. Hence, in order for the 
walker to return to at time t it must be at distance t - [gl2\ from and then take the unique path of 
length [gl2\ to so that q can be taken to be any number smaller than c/2. The bounds on p f [x, y) 
are proved similarly. 

We take a m = CQogl/) -1 (which is at least 1/m by our assumption that m > clogV) and prove 
that conditions (2) and (3) of Theorem [L3] hold. Note that mo = O(logl^). To show condition (2) we 
show that percolation with p={m- 1) _1 (1 + a m /\ogV) has E p 1^(0)1 » V 113 , whence p c ^P and thus 
condition (2) holds. To show this lower bound on E p 1^(0)1 in this regime of p it is possible to use a 
classical sprinkling argument. However, it is quicker to use [44, Theorem 4] and verify that 

£~ l r £ [(1 + £) fAr - l]p f (0,0) = 0(1) , (7.20) 

f=l 

where £ = a m l\ogV and r = £ -1 [log(£ 3 l0 -3loglog(£ 3 ^)]. Theorem 4 of O then yields that PfJ^il > 
£>£W(log(£ 3 l0) 3 ) = 1 - o(l) for some b > which immediately gives a lower bound on E|^(0)| since 

E|if (0)i > v^toi 2 > (i + o(i))^ 2 £ 2 y/(iog(£ 3 y)) 6 » v 113 , 
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by our choice of e. We use our bounds on p f (0, 0) above and sum 117.201 1 separately for t < Clog V and 
t > ClogV. For t< Clog V we bound (1 + c) f - 1 = 0{et) and use our first bound p f (0,0) < V~ q to get 

ClogV ClogV 

e~ l r Y, [d + e) fAr -l]p f (0,0)<r £ tV~ q = o{\). 
r=i f=i 

When t > Clog V we bound 

(1 + £) fAr - 1 < (1 + e) r = e 3 V{\oge 3 V)~ 3 = 0(c 3 V(log V)~ 3 ) , 
by our choice of e. We use our second bound p f (0,0) < CV~ l to bound 

e~ l r £ [(l + e) fAr -l]p f (0,0) = O(r 2 £ 2 (logV)- 3 ) = o(l), 

f=ClogV 

since r < C(log V) 2 . This concludes the verification of condition (2) of Theorem ll.31 
To verify condition (3) we need to prove the bound 

ClogV 

£ £ p tl (x,u)v t2 (u,v)p t Hv,y) = 0{(\ogVr 2 ). (7.21) 

u,vti,t 2 ,t3 : t\ + t 2 + t 3 >3 

We first handle the special case of (fi, f 2 > ^3) = (1> 1- !)• An immediate calculation with Lemma l3.11l 
gives that (on any regular graph of degree m) 

£p : (x, u)p l (u, v)p 1 {v,y) = 0(l/m 2 ). 

u,v 

In all other cases of {t\, t 2 , W we use our bound on p f, (x,y) for i e 1,2,3 such that t{ is the largest 
of fi, t2, t3 (which must be at least 2). We pull this bound out of the sum, and sum the other two 
terms over u and v to get a multiplicative contribution of precisely 1. The sum over [t\, t2, W such 
that 3 < t\ + ?2 + h < 15 is bounded by C(logV)" 2 since the number of such triplets is bounded, 
and each contributes at most C(logl^)" 2 because one of the ti's is at least 2, so that our bounds on 
p f (x,y) guarantee that for this U we have p f, (v) ^ 0(1/ m 2 ) < 0(1 /(log V) 2 ) by the assumption that 
m > clog V. Similarly, the sum over triplets Ui, h> ^3) such that 1 1 + t 2 + £3 > 15 and ti < tuq is also 
bounded by C(log V)~ 2 since the number of such triplets is at most C(log V) 3 , and each contributes at 
most C(log V)" 5 because at least one of the ti's is at least 5 and for this ti we have p fi (•, •) < C(log V)" 5 
again by our assumption that m > clog V. This concludes our verification of conditions (2) and (3) of 
Theorem ll.3l and concludes our proof. □ 

8. Open problems 

(1) In this paper we prove a law of large numbers for 1^1 above the critical window for percolation 
on the hypercube. Show that l^i | satisfies a central limit theorem in this regime. In G{n, p) this 
and much more was established by Pitt el and Wormald 1 49 1 . 

(2) Showthatl^zl = (2+o(l))c _2 log(£ 3 2 m ) when p = p c (l+e) and that l^il = (2+o(l))£- 2 log(c 3 2 w1 ) 
when p = p c {\ - e) for e » V~ 113 and e = o(l). This is the content of [T7J Conjectures 3.1 
and 3.3]. In [14] this is proved for e > 60{\ogn) 3 / n in the supercritical regime, and for e > 
(logn) 2 /(n 1/2 loglogn) in the subcritical regime. In G{n,p) these results are proved in |49| and 
1361 Theorem 5.61. 
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(3) Show that (|Sf/|2~ 2m/3 )j>i converges in distribution when p = p c (l + t2~ ml3 ) and teUis fixed 
and identify the limit distribution. Up to a time change, this should be the limiting distribu- 
tion of {\^j\n~ 2l3 )j>i in G{n,p) with p = (1 + tn~ ll3 )/n identified by Aldous Q]. 

(4) Consider percolation on the nearest-neighbor torus Z d where d is a large fixed constant and 
n — oo with p = p c {l + e) such that e » n~ ande = o(l). Show that \ c ^i\/(en d ) converges to a 
constant. Does this constant equal the limit as e | of e~ 1 6 Z d {p c 0- + £))? Here Z d {p) denotes 
the probability that the cluster of the origin is infinite at p-bond percolation on the infinite lat- 
tice J- d . The techniques of this paper are not sufficient to show this mainly because condition 
(2) of Theorem ll.3l does not hold in (in fact, it is easy to see that p c - [2d - l)" 1 > c> for 
some positive constant c = c{d) — this is always the case when our underlying transitive graph 
has constant degree and short cycles). The critical regime of this graph is well understood by 
the works fT5iri6lf28lf29l. 

(5) Show that the finite triangle condition 11.51 holds on any family of expander graphs. 

(6) Let 6 > be a fixed constant and consider the giant component ^ obtained by performing 
percolation on the hypercube with p = (\ + 6)lm. Show that whp the mixing time of the simple 
random walk on is polynomial in m. Is this mixing time of order m 2 ? This is what one 
expects by the analogous question on G{n, p), see (9j[25) • Further analogy with the near-critical 
G{n,p) (see |21|) suggests that whp the mixing time on <€\ when p = p c [l + e) with the usual 
condition that £ » 2~ ml3 and e = o(l) is of order £~ 3 log(£ 3 2 m ). 

Appendix A. Asymptotics of the super-critical cluster tail 

Our goal in this section is to prove Theorem l2.21 In 1 15], Theorem l2.2l is proved without the precise 
constant 2. Here we sharpen this proof to get this constant. We assume that G is a general transitive 
graph having degree m and volume V satisfying the finite triangle condition 11.5II . In order to stay 
close to the notation in 1 15 1, we define 

V^ ax = supV p (x,y), 

and 

Tp(x) = P p (0 — x). (Al) 

Proposition A.1 (Upper bound on the cluster tail). Let G be a finite transitive graph of degree m on 

V vertices such that the finite triangle condition fi.5D holds and put p = p c (l + e) where e = o(l) and 
e » V~ 113 . Then, for every k = k E satisfyingk E > e~ 2 , 

P p (|«tf (0)| > k) < 2e{l + 0(e+{e 3 V)~ 1 + {e 2 k)~ 114 + a m )). (A.2) 

Proposition A.2 (Lower bound on the cluster tail). Let G be a finite transitive graph of degree m on 

V vertices such that the finite triangle condition fi.5D holds and put p = p c {\ + e) where e = o(l) and 
e » V~ 113 . Then, for every a e (0, 1/3), there exists a c = c{a) > such that 

P p (|^(0)| > <T 2 (rV) a ) > 2e{\ + 0{e + (£ 3 V)~ c + a m )). (A3) 

Remark. The above propositions apply also to infinite transitive graphs (where {e 3 V)~ c is replaced by 
0), assuming that 111.51 1 holds with %{p) 3 IV replaced by 0. 



Proof of Theorem l2.2l This proof follows immediately from the above propositions. 



□ 
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A.l. Differential inequalities. We follow (T5j Section 5]. For p,j e [0, 1], we define the magnetization 

by 

v 

M(p/r) = £[l-(l-y)*]P p (|W)| = fc). (A.4) 

fe=l 

For fixed p, the function y >->• M(p, y) is strictly increasing, with M(p, 0) = and M{p, 1) = 1. When we 
color all vertices independently green with probability y , and we let ^ denote the set of green vertices, 
then 1A..4I has the appealing probabilistic interpretation of 

M(p,y) = P p>r (0 — m, (A.5) 

where P p , r is the probability measure of the joint bond and site percolation model, where bonds and 
sites have an independent status. This representation is important for the derivation of useful differ- 
ential inequalities involving the magnetization. 

Lemma A.3 (Differential inequalities for the magnetization). Let G be a finite transitive graph on V 
vertices and degree m. Then at any p, y e (0, 1) 

dM dM 
a-p)— <m(l-y)M— , (A.6) 
op ay 

dM A o . r l . dM 

M<y-^— + [-mpM +jM\ + [-mpM + j\p-^— , (A.7) 

and 

r 1 7 y,dM 

M> mp[Y+ (l-y)-m(ra- Y)p a(p)M^\— , (A.8) 

where 

a(p) = {\-2pf -{\ + mp + 2{mpf)^^-mpM-{mpfM 2 . (A.9) 

The inequality iA.6i is proved in (T) , where it was used to prove the sharpness of the percolation 
phase transition on Z d , and was first stated in the context of finite graphs in |15, (5.14)]. The differen- 
tial inequality in JA.7I) is an adaptation of another differential inequality proved and used in 1 1 1 , which 
is improved here in order to obtain sharp constants in our bounds. The bound in JA.8b is an adap- 
tation of [151 (5.16)], which was used there in order to prove an upper bound on M{p,j). Again, the 
inequality is adapted in order to obtain the optimal constants. We will first use Lemma lA3l to obtain 
Propositions|AT] and|A2l 

A.2. The magnetization for subcritical p. We take p = p c {\ - e) with e = o(l) and e 3 V » 1, and we 
take y = o(l). Then, [151 Lemma 5.3] shows that M(p,y) = 0[y/f). The main aim of this section is to 
improve upon this bound, using the improved differential inequality in 1A81 I . 

We have that M{p,y) = 0{yjj) and %(p) = 0(l/e) by [151 Theorem 1.5]. Further more, assumption 
ITESI gives that V max = 0(a m + (e 3 V)" 1 ) and QUI (1.30)] then implies that mp<l + 0(a m ). Putting all 
this into I IA.9I I yields 

a(p)>l + 0(v?+(£ 3 V)- 1 + a m ). (A.10) 
Substituting I IAIOII into I IA.8I) in turn gives that 

M > (1 + 0(v?+ (£ 3 V)" 1 + a m ))[ r + ^M 2 }^-. (All) 

We now use this to prove the following lemma: 

Lemma A.4 (Upper bound on the slightly subcritical magnetization) . Let G be a finite transitive graph 
of degree monV vertices such that the finite triangle condition (l.SH holds. Lety = o(l) and put p - 
p c {l - e) with e = o(l) and£ 3 V » 1. Then, 

M(p, T ) < y/2r[l + 0(v?+ OrVr 1 + a m )). (A.12) 
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A similar bound as in Lemma lA~4l was proved in [151 Lemma 5.3], whose proof we adapt here, with 
y/2j replaced with y/l2j, and a less precise error bound. The precise constant \fl is important for us 
here as it relates to the constant 2 for the 2e{\ + o(l)) survival probability. 

Proof. We note that I IA.lll l implies that 

B JM 

M>-M 2 — , (A.13) 
2 dj 

where we abbreviate 5 = 1 + 0[\fj+ (e 3 ^)" 1 + a m ). Therefore, 

d[M 2 ] 

— -<4/B. (A.14) 

Integrating between and y, and using that M{p, 0) = yields that 

M 2 <4y/B, (A. 15) 

so that M < y/f{2/ \f~B~). Now, when we have this inequality, we can further bound 

B n 

y>-M 2 , (A. 16) 



so that by HA. 1 11 1 we get 
Performing the same integration steps, we arrive at 



M>B 1/2 + B/4 M 2 . (A.17) 

oj 



? 2 

M 2 < —v. (A.18) 

Therefore, the constant has become a little better (recall that B is close to 1). Iterating these steps 
yields that, for every k > 1, 

7 2 

M 2 < — r. (A. 19) 

Z k l=l {BI2V 

We prove dA. 19b by induction on k, the initialization for k = 1,2 having been proved above. To advance 
the induction hypothesis, suppose that dA. 19b holds for k > 1. Define 

k 

A k =Y J (B/2)i, (A.20) 
Z=l 

so that HA. 1911 is equivalent to M 2 < 2j/A k . In turn, this yields that y > A k M 2 12, so that 



M>B 

which in turn yields that 



Afc/2 + 1/2 



M 2 , (A.21) 

dj 



2 2 

M < r. (A.22) 

B[l + A k ]l2 V 

Note that 

B[l + A k ]/2 = A k+l , (A.23) 
which advances the induction. By 1A.191 . we obtain that 

M2s lS^7iiT r = 2[2 ^ lr/B ' <A ' 24) 

Finally the fact that 

[2-B]/B = l + 0{^/r+(e 3 V)~ 1 + a m ) (A.25) 

completes the proof. □ 
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A.3. The magnetization for supercritical p. In this section, we use extrapolation inequalities to ob- 
tain a bound on the supercritical magnetization from the subcritical one derived in Lemma lA"!4l Our 
precise result is the following: 

Lemma A.5 (Upper bound on the slightly supercritical magnetization). Let G be a finite transitive 
graph of degree m onV vertices such that the finite triangle condition (1.50 holds and put p = p c [l + e) 
where e-o{l) ande^> V~ 113 . Then for any ce (0,1/3), 



M(p, T ) <(e+ ^2y + £ 2 )(l + 0{e + ^f+ (e 3 V)~ c + a m )). (A.26) 

Proof We follow the proof in [15j Section 5.3], paying special attention to the constants and error 
terms. Indeed, we use 1A.61 and the chain rule to deduce that, with A = (1 - 2p c )~ l , and M(p, h) = 
M(p,l-e~ h ), 

dM - dM 

< mAM . (A.27) 

op oh 

Take Pi = (p c (l + £),h) and write m\ = M{P\). Further, take r\ = e{e 3 V)~ c for some c e (0, 1/3), so that 
rj = o{e) and rj 3 V ^ oo, and take P 2 = (p c (l -n), Amis'), where 

h 

e' = E + r> + . (A.28) 

Ami 

Then, with m 2 = M{P 2 ), we have that m 2 > mi (see e.g., (T5l (5.46)]). Therefore, by Lemma IA.4I and 
again writing B = 1 + 0[^/j+ (e 3 V)~ 1 + a m ) withy = 1 -e~ h , 



M{p, 1 - e~ h ) = mi < m 2 < V 25(1 - e - Am ^') (A.29) 



(1 + O(mie')) \JlABmie 1 



(1 + O(mie)) \2ABm x {e + r])+2Bh 



= {\ + 0{E+{e 3 V)~ c ))\/2ABmiE + 2Bh, 
where in the last inequality we use that r\ = e{e 3 V)~ c « e and mi < 1. The inequality 



mi < \]2ABm x e + 2Bh 

has roots 



m ± = ABe + \/2Bh + (AB£) 2 . (A.30) 
Since mi > and m+ > while m_ < 0, we deduce that 



M(p c + £/m,l-e~ h ) = mi< (1 + 0(e + (e 3 V)~ c )){ABe+ \/2Bh + (ABe) 2 ). (A.31) 

We have that y= I - e~ h = h(l + 0(h)) and A = l + 0(a m ) (by 1 15, (1.30)]) and 5 = 1 + 0(/f+ (e 3 ^)" 1 + 
a m ) . Putting all this together in the last inequality completes the proof. □ 

Proof of Proposition lATTl We note that, for any I > k > 1 and a > 0, 

l-(l-a/fc)'> l-e" fl . (A.32) 

Therefore, by (KM , 

PpG* (0)| > k) < [1 - e~ a ]~ l M{p, alk). (A.33) 

Recall that k » e" 2 and take a = (£ 2 /c) 1/2 so that alk = £ 2 {£ 2 k)~ 112 = o(e 2 ). We note that for y = 
£ 2 {£ 2 k)~ 112 , ED reduces to 

M(p,j) <2e(1 + 0(e + (E 3 Vy 1 + (£ 2 kr 114 + a m )). (A.34) 
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Then, by E34l and the fact that 1 - e~ a = 1 + o{{£ 2 kT 114 ), 

M{p, alk) < 2e(l + 0(e+ (eV)" 1 + (e 2 /c) _1/4 + a m )). (A.35) 
This completes the proof of Proposition lA.ll □ 

A.4. Lower bound on tail probabilities. In the remainder of this section, we shall prove Proposition 
IA.2I Throughout this proof, we will take p = p c [l + e). 

We shall assume that with /c = e~ 2 {e 3 V) a » e~ 2 and a e (0, 1/3), there exists b w = bi {a) such that 

P p {\<g[v)\ > £- 2 (s 3 V) a ) > b w e. (A.36) 

The bound in 1A.36I ) is proved for finite graphs in [15j Theorem 1.6(i)] and in |8|, in conjunction with 
[271 , on infinite lattices satisfying the triangle condition. The proof of HA.36I I is similar to the argument 
we shall give for the improved bound, and shall be omitted here. In turn, flA.361 implies that, for 
y = l/k = e 2 {e 3 V)~ a = o(e 2 ), there exists a constant b\a such that 

M(p,j) > [1 - [1 - j] k °]V p {\^{v)\ > ko) > b w e. (A.37) 

Equation flA.371 will be an essential ingredient in our proof. We start by proving the following lemma: 

Lemma A.6 (Lower bound on the magnetization). Let G be a finite transitive graph of degree m on 
V vertices such that the finite triangle condition (1.5H holds and put p = p c {\ + e) where e = o(l) and 
£» V~ 113 . Then, for j = e 2 {e 3 V)~ a withae (0,1/3) andanyc< 1, 

M{p,y) >2e[l + 0[e + (e 3 Vr c + a m )\. (A.38) 

Proof. Throughout the proof, we fix a £ (0, 1/3). We recall the differential inequality I IA.71 I 

dM A o . r l . dM 

M <r + \-mpM + jM\ + \-mpM + Y\p . (A.39) 

dj 2 2 dp 

By HA.37I I. and the fact that j <-> M{p,j) is increasing, for any y = e 2 {e 3 V)~ a we have that j = 0{Me). 
Further, mp < 1 + 0(e + a m ), so that, for some A > 1 with A = 1 + 0{e + a m ) we obtain 

dM A ? A dM 

M<y + -M 2 + -Mp . (A.40) 

dj 2 2 dp 

We rewrite JA.40l> as 

I dM 1 d A 

- T*ir + -t- [ oP m - P ] > (A - 41) 

M dj j dp 2 

and integrate for y e [yo,yi] and p e [p , p{\, where yo = {8e) 2 {5 3 e 3 V)~ a . We note that JA.371) holds for 
Po = p c (l + £8) for any 6 = o(l) and y = yo- We further take 

p = p c (l+8e), Pl = p c a + e), y 1 = e aog(1/5)) Vo, (A.42) 

where a > 1 is chosen below. 

Then, as in (23 (5.57) and the argument below it] , by the fact that p>-> M{p, y) and y —>• M{p, y) are 
non-decreasing, 

, M(pi,Yi) , A 

< ( Pl - po) log — - + log(n /yo) I- Pi M( Pl , y i) - ( Pl - p )] . (A.43) 
M{p ,Yo) 2 

Now, 

log(yi/y ) = (log(l/5)) fl , (A.44) 

while, by Lemma lA.5l and JA.371 I . 

M( Pl ,Yi)<2e{l + 0[e + (e 3 Vr 1 )), M(p , T o) > b 10 6e, (A.45) 
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so that, for 6 > sufficiently small, 



log — fa^j < log(2e/(fci 5e)) < 2log(l/<5). (A.46) 



|p 1 M(p 1 ,y 1 )>p c a-<5)e[l-2(log(l/5)) 1 - fl 



Dividing I IA.43I I through by (log(l/5)) a , we arrive at 

(A.47) 

Recalling that p\ = p c (l + e) and that a > 1, as well as the fact that A = 1 + 0(e + a m ), this yields 

M(p,ri) > 2e[l + 0(£ + (logd/^) 1 " + a m )]. (A.48) 

Finally, note that 

ri = e ao g (i/5))« ro = e aoga/«)« (5£) 2 (5 3 £ 3 F) -« (A49) 

= e 2 {e 3 Vr a {e aogaiS))a 6 2 - 3a ) > eVVT", 

when we take 8 = e - ^ 3 ^ 1 ° for any a > 1. Indeed, then e (i°g( 1/5 )) fl <>) 2 - 3Q: qq as § o. Since y >-► 
M{p,j) is increasing, this implies that 

M(p,y) > M(p, T i) > 2e[l + 0(e + (logd/S)) 1 ^ + a m )] =2e[l + 0{e+ ^V) lla ~ l + a m )]. (A.50) 

Denoting c = 1 - 1 / a, this proves the claim. □ 

Proof of Proposition lA.2l We use El (6.5) in Lemma 6.1], which states that, for any < yo,7i ^ 1» 

P p (|^(0)| > k) > M(p, T i) -—e rok M(p, To ). (A.51) 

To 

Now we take ji = e 2 {e 3 V)~ a ' with a' e (0, 1/3) taken as in Lemma [A6l Jq = e 2 (e 3 V)~ a with a < a', and 
= 1 /y . Then e r ° fc = e, while, by Lemma lABl and the fact that y = o[e 2 ), we obtain that 

M(p,y )<2£(l + o(l)). (A.52) 

Therefore, by Lemma lA.6l and 1A.51I . taking c = 1/2 in Lemma lATBl 

P p (|^(0)| > A:) > 2e(l + 0(e + {e 3 Vy 112 + ct m )) - (e 3 V) a '- a O(e). (A.53) 

We obtain that 

Pp(|^(0)| >e~ 2 {e 3 V) a ) >2e(l + 0{e+(e 3 Vr ll2 + (e 3 V) a ~ a ' + a m )). (A.54) 

This proves the claim in Proposition lA.2l with c = a - a' e (0, 1 /3) . □ 

A.5. Derivation of iA.8h We follow the proof in [15j Appendix A.2] as closely as possible, deviating in 
one essential inequality. Indeed, in |T5j (A.23-A.32)], it is proved that 

dM 

M(p, r ) > pm—(p,r)P p , r (0 o<£)-X 2 - X 3 . (A.55) 
aj 

We copy the bounds on X 2 and X 3 in [15j (A.46)] and El (A.53)] respectively, which prove that 

X 2 < p 2 mM 2 (p,y) — (p, r ), X 3 < ^pmM 2 {p,r) — {p,r), (A.56) 

and we improve upon the lower bound on Pp, r (0 o only. Our precise results is contained in the 
following lemma: 
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Lemma A. 7 (Improved lower bound on the double connection). For all p,j£ [0,1], 

P PiT (0 o <S) > r+ {l-j)^m{m-l)p 2 a(p)M 2 (p,r), (A.57) 

where 

a{p) = {\-2pf - (1 + rap + 2(rap) 2 )V™ ax - mpM{p,j) - {mp) 2 M{p,j) 2 . 

Proof. Note that if e ^, then occurs. Therefore, we obtain 

P p , r (0 o <g) = 7+ (1 - 7)Pp, r (0 o9|0?&). (A.58) 

Thus, we are left to obtain a lower bound on P p>r (0 o <^ | £ For this, we follow the original 
argument in [15, Section A.2], adapting it when necessary. 

For a directed bond b = (x, y), we write b = x and b = y for its top and bottom. Let e,f be two 
distinct bonds with e = f = 0, and let E e t be the event that the bonds e and / are occupied, and 
that in the reduced graph G~ = [V~ ,E~) obtained by removing the bonds e and /, the following three 
events occur: ~e ■> — ► C S, f < — ► C S, and n ^ (/) = 0. 

Let P~ r denote the joint bond/vertex measure on G". We note that the event {0 o <£} contains the 
event \J e tE e t, where the (non-disjoint) union is over unordered pairs of bonds e,f incident to the 
origin. Then, by Bonferroni's inequality and since E e t is independent of we get 

P Pi7 (0 o c S\0$. c S)> P P)T (l) e>f E eif \0&<g) > Y, p p,t(^,/) " Y i (A-59) 
= p 2 £p- y (? — <S, J~<S, <g(e)n<g(f) = 0)-Y h 

e,f 

where 

Y i = \ E Pp,r(E ei , fl nE e2if2 \0g^). (A.60) 

1 {ei,/i}/te,/ 2 } 

We first bound Y\, For this, we note that there are two contributions to Y\, depending on the number 
of distinct elements in {e\, f\,e 2 , f 2 }, which can be 3 or 4, and whose contributions we denote by Yi, 3 
and Yi )4 , respectively. 

We start by bounding F li3 . The number of pairs of pairs of edges ^ {62-/2} such that \{ei,fi,e 2 ,f 2 }\ = 

3 is m(m - l)(m - 2). For such a pair, let xi,x 2 ,x 3 denote the distinct elements of {ei,f l ,'e2,fzt such 
that X\ corresponds to the end of the edge that appears twice in {e\,f\,e 2 , f 2 }. If E ex j Y n E e2 j 2 occurs, 
then either 

{(0, JCi) OCC.} o {(0, X 2 ) OCC.} o {(0, X 3 ) OCC.} o {Xi < — ► <^} o {x 2 < — ► ^} o {x 3 < — ► 

occurs, or there exists a z such that 

{(0, Xi) OCC.} o {(0, X2) OCC.} o {(0, X3) OCC.} o {Xi < — ► ^} 
o {X 2 « ► Z\ o {x 3 < ► Z} o {z < ► 

occurs. Therefore, 

F 1>3 < (1 - T )im(m - l)(m- 2)p 3 M(p, r ) 2 [M(p, r ) + V™ 3 *] , (A.61) 

where we bounded 

£Pp(X 2 — Z)P p (X 3 — Z) < V™ 3 *, 

z 

which is wasteful, but sufficient for our purposes. 
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For Y" li4 , we sum over {ei,/i} ^ {^2-/2} with the constraint that all these edges are distinct. The 
number of such pairs of pairs is m(m - l)(m - 2)(m - 3)/4. Then, a similar computation as for Yi )3 
yields that 

Yi >4 < (1 - 7) ^ m( m - 1) (m - 2) (m - 3) p 4 M(p, y) 2 [M(p, jf + 8V™ "*] . (A.62) 
We continue to bound the sum over {e, /} in dA.591 ) from below. Let 

W = W eJ = {e< — ► <£, J < — * ( S, <g(e) n ^(/) = 0}, (A.63) 

denote the event whose probability appears on the right side of I IA.59I I . Conditioning on the set ^ (e) = 
Ac V~, we see that 

P P, r (W) = E Pp, r (^(e) = A e < — ► C S, J < — ► C S, <€(e) n <€{f) = 0). (A.64) 

A:/eA 

This can be rewritten as 

P P,r^)= E Pp iy i e e(e) = A,e~y,f~<SmV-\A), (A.65) 

A:/eA 

where {/ < — ► ^ in V~ \ A} is the event that there exists such that / < — ► x in V~ \ A. The inter- 
section of the first two events on the right hand side of (A65J is independent of the third event, and 
hence 

P P, r (W)= E Pp, r (^© = A, e < — ► <S) Pp, y (7 * — * ^ in V" \ A). (A.66) 

A:e£A 

Let M _ (x) = Pp T (x - — ► C S), for x e V~ . Then, by the BK inequality and the fact that the two-point 
function on G~ is bounded above by the two-point function on G, 

p p,r (7 " — " c SmV~\A) = AT (/) - Pp", r (f^ c S only on A) (A.67) 

>M"(7)- £r p (7, y)M-(y). 

ye A 

By definition and the BK inequality, 

M~(x) = M(p,y)-Pp )7 (eor/isocc. andpiv. forx< — ► (A.68) 
>M(p,y)(l-2p). 

It follows from dA.66D - jAr68l ) and the upper bound M" (x) < M(p, y) that 

p- iT {W) > M(p, r ) E Pp", r (^ra = a, e~ m[a-2 P )- E^(7.y)] 

A:ee"A ye A 

= M(p,y)[M-(e)(l-2p)- £ r p (7,y)p- r (?— y, ? — <^)]. (A.69) 

yeV~ 

It is not difficult to show, using the BK inequality, that 

P~ r (?< — >y,e< — £ T p (e,w)r p (w,y)M~(w), (A.70) 

and hence, by llA68l - llA69ll . 

p- iY (W)>M(p,y)[M-(e)a-2p)- £ r p (f,y)T p (e, w)T p {w t y)AT{w)] 

y, we y~ 

>M 2 (p,y)[(l-2p) 2 -V^ iax ]. 
This completes the proof of dA.571) . □ 
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A.6. Derivation of HA.7K In this section, we prove HA.7I I, which is an adaptation of the proof of the 
related inequality 

dM ? dM 

M <v + M z + pM , (A.71) 

dy op 

which is proved in |1| (see also (261 Lemma (5.53)]). The main difference between I IA.71I I and I IA.7I I is 
in the precise constants. Indeed, we have that pm»l and M » y, so that, I IA.7I I is morally equivalent 
to 

DM 1 o 1 dM 
M< T — + -M 2 + -pM—, (A.72) 
dy 2 2 op 

i.e., in the inequality in <A.7U the last two terms are multiplied by 1/2. 

We follow the proof of |26, Lemma (5.53)] as closely as possible, deviating only when necessary. 
Indeed, 

M{p, r ) = P P)7 mo) n # ? 0) = P PiT (1^(0) n <S\ = 1) + P PiT (|^(0) n »| > 2). (A. 73) 

The first term on the r.h.s. of 1A.731 equals y^M-, as derived in |26, (5.69)]. For the second term, we 
define A x to be the event that either xe^or that x is connected by an occupied path to a vertex 
ge<£. Then, 

P p , r (l^(0) n #| > 2) = P p , r (A, o A ) (A. 74) 

+ Pp, r (|^(0) n <g\ > 2, A o A does not occur). 

In the derivation of 1A.71I I, we simply apply the BK-inequality to obtain 

Pp, r (A oA ) < Pp, r (A ) 2 = M(p,yf, (A.75) 

leading to the second term in 1A.71I I. Instead, we split, depending on whether e ^ or not. If e C S, 
then 0e^ occurs disjointly from Aq, so that the BK-inequality yields 

Pp, r (A o A ,0 e m < Pp, r (A o {0 e <£}) < yP(A ) = yM{p,y). (A.76) 

When, instead, £ there must be at least two neighbors e of the origin for which the event 

A e oA o{(0,e) occ.} (A.77) 

occurs. Therefore, we can bound, with N denoting the number of neighbors e for which the event in 
dA.771 ) occurs, so that N > 2 a.s. and Markov's inequality yields 



Pp, r (A oA ,0^)< £Ep 



r 

e~0 



— U{A.°A)°{(0,e) occ.}} 



55 9 E p p,r ^0 o {(0, e) occ.}). (A.78) 

^ e~0 



Therefore, again by the BK-inequality 



Pp, r (A oA ,0^^) < -£Pp, r (A e )Pp jr (Ao)p= \pmM{p,yf, 

2 e 2 

so that 

1 2 

Pp, r (A oAo) < -pmM{p,yr + yM(p,y), (A.79) 

which yields the second term in HA.7I I. 

We move on to the bound on the probability of the event that 1^(0) n C S\ > 2, but that A ° A does 
not occur. This event is equivalent to the existence of an edge b = {x, y) for which the following occurs: 

(i) the edge b is occupied; and 
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(ii) in the subgraph of G obtained by deleting b, the following events occur: 

(a) no vertex of is joined to the origin by an open path; 

(b) x is joined to by an occupied path; 

(c) the event A y o A y occurs. 

The events in (ii) are independent of the occupation status of the edge b = {x, y) so that 

P p , r (|^(0) n #| > 2, A o A Q does not occur) (A.80) 
= — — Y, Pp, r ((x, y) closed, x e ^(0),^(0) n <3 = 0, A y o A y ) 

1 — P x~y 

< -3- E p P.r (x^(O)^(O)n^ = 0,^oA y ), 

1 P x~y 

where we write x ~ y to denote that (jc, y) is a bond. We condition on ^ (0) to obtain 

Pp, r (<#(O)n^ = 0,A y oA y ) (A.81) 
= -T— E E P P^(°) = A)P p , r (^(0) = 0, A y o A y I <£(0) = A), 

where the sum over A is over all sets of vertices which contain and x but not y. Conditionally on 
<t?(0) = A, the events ^(0) n ^ = and A y o A y are independent, since ^(0) n ^ = is defined on the 
vertices in A, while A y ° A y depends on the vertices in A c and the edges between them. Thus, 

P p>r (W) n c S = 0,A y oA y \ ^(0) = A) (A.82) 
= P p , r (^(0) n <S = I ^(0) = A)P p , r [A y o A y off A), 

where we write {A y ° A y off A} for the event that A y o A y occurs in the graph where all edges with at 
least one endpoint in the set A are removed. So far, the derivation equals that in the proof of |26, 
Lemma (5.53)] . Now we shall deviate. We split, depending on whether y e <£ or not, to obtain 

P p , T {Ay O Ay Off A) = V p>J {Ay O Ay Off A, V £ ^) + Pp^ (Ay O A y Off A, J/ £ ^) . 

When y e <£, 

{A y o Ay off A, y e ^} = {({y e <S] o A y ) off A}, (A.83) 

so that, by the BK- inequality, 

Pp, r {A y o A y off A, y G < yPp )7 ( A y off A) . (A.84) 



As a result, 



P_ 

P 

TP 



£ Pp, r (xG^(0),^(0)n^ = 0,A y oA y ,ye^) (A.85) 

1 — P x~y 



_ X E P P^°) = A)Pp )T (^(0) = I ^(0) = A)V m {A y off A) 

12- X E P P^(°) = A)P p>r m0) C\<£ = 0,A y I <£(0) = A) 
1 P x~y A 

YD ^ dM 

E p p,r ( x e ^(0),^(0) n <3 = 0, c €{y) n<S^0)= jp—, 



1-p^y dp 

where the first equality follows again by conditional independence, and the last equality by the fact 
that (see ES (5.67)]) 

{l-p)—= ^Pp, r (xe^(O),^(O)n^ = 0,<tf(y)n^^0). (A.86) 

OP x~y 
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We are left to bound the contribution where y For this, we note that when A y o A y occurs off A 
and yt^, then there must be at least two neighbors z of y for which the event 

\[A z o A y o{{y,z) occ.}) off A} (A.87) 

occurs. Therefore, by a similar argument as in JA.78I I. 

V PiJ {A y oA y off A,yt&) * \ E PpJ(A z oA y o{ty,z) occ.}) off A (A.88) 

By the BK-inequality, 

Pp, r ((Az oA y o {(y, z) occ.}) off a) < pP p , r {A z off A)P p , r (A y off A) 

< pM(p,r)P PiT (Ay off A). 

Repeating the steps in HA.85K we thus arrive at 

T 2 — E P p , r (xe^(0),^(0) n<d? = 0,A y o A y ,y £^) < -mp 2 M{p,j)—. (A.89) 

1-Px~y 2 dp 

Therefore, summing the two bounds in JA.851 ) and lfA89j, we arrive at 

r l . dM 

P pr (|^(0) n^| >2,A oA does not occur) < \-mpM{p,r)+j\p——, 

2 op 

which is the third term in 1A71 . This completes the proof of JA.7II . □ 
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